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1.0 INTRODUCTION 


Analog charge transfer devices (CTD'sl offer unique 
advantages to signal processing systems, which often have large 
development costs, making it desirable to define those devices 
which can be developed for general system's use. Such devices 
are best identified and developed early to give system’s designers 
some interchangeable subsystem blocks , not requiring additional 
individual development for each new signal processing system. 

The objective of this work is to describe a discrete analog signal 
processing device with a reasonably broad system use and to imple- 
ment its design, fabrication, and testing. 

The "Operational Sampled Analog Processor" (OP~SAP) was 
selected ^ as the broadly attractive, general purpose vehicle 
and is functionally illustrated m Figure 1. Two sets of ANALOG 
data are presented to parallel analog delay means with the output 
taken as the sum of the true four-quadrant products. If one set 
of analog data are shifted by the other set of data, the resulting 
sequence of computations may be regarded as a discrete convolution 
function for relative motion in one direction; while relative 
motion in the other direction gives either a cross-correlation 
function for different sets of analog data or an autocorrelation 
function when the same data set is used in both channels. Trans- 
versal filters are a further example of a sequence of computations 
like that shown in Figure 1. 
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Any device capable of the preceding correlation/convolu- 
tion functions is also capable ofamore limited class of applications 
which may be called analog matrix arithmetic- An example is the 
analog vector-dot product m which there is one only ONE sum 
computation with the pairing of analog values to be multiplied 
together strictly specified before the two data sets are entered 
into the CTD OP-SAP . Consequently, the motion of one set of analog 
samples relative to the other is inconsequential for this limited 
subset of applications. Thus, although of academic interest for 
the sake of completeness, any analog matrix arithmetic device 
limited by its inability to move one set of analog samples relative 
to the other will not be given detailed consideration here because 
it fails to meet the objective of a broadly useable analog signal 
processing item. A further restriction on the present investigation 
of CTD correlation techniques is the format of the input data: 
both sets of data are purely analog in contrast to binary coded 
analog for one of the data sets. 

Within the boundaries prescribed by the above guidelines, 
all the essential constituents of a monolithic CTD analog-analog 
correlator/convolver chip have been investigated and designed on 
the basis of both experimental and computer model data with most 
of those elements subsequently incorporated into a hybrid model 
demonstrating the accuracy and feasibility of the preceding mono- 
lithic design effort. After the basic elements of the device are 
described, two attractive concepts for the bias of the multipliers 
are presented. Discussed next are some sources of deviations from 
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true analog-analog multiplication, such as harmonic distortion 
and direct .feedthrough of the input signals due to unbalances 
from array threshold nonuniformities, as well as corrective 
measures to' reduce the problems. ' Computer models illus- 
trate the design technique based on the desired fabrication process 
and performance specifications including some broadly limiting 
formulas relating performance to fabrication and operation para- 
meters. These formulas are tested against other published performanc 
data after a description of the previously delivered hardware. 

The summary and conclusions follow additional information on 
fixed pattern noise and multiplexing of analog-analog correlators . 
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'2.0 ELEMENTS OF THE DEVICE 

The basic device elements shown m Figure 1 are of two 
types: an array of multiplying means disposed between two 

analog delay means. Since substantial effort has previously 
been devoted to the needed analog delay means, only a brief 
comparison of the various attractive possibilities is now presented. 

2.1 Analog Delay Lines With Independent Parallel Nondestructive 
Readout (NDRO) 


A technique for NDRO in charge transfer devices is to 
sense the displacement of majority carrier charge which occurs when 
miniority carriers m a CCD move beneath a floating CCD electrode. 
Figure 2 illustrates the use of a floating clock electrode technique 
to convert the signal charge q g to a voltage v^ on the gate of a MOS 
electrometer amplifier: 


C q 

v„ = 22§ 

G C (C + C_) + C„ C, 


s o 


'o G 


where C„ is the external capacitance associated with the reset switch 

Q^, parasitic interelectrode capacitance, and gate capacitance of Q 2 • 

♦ 

Thus, considerable loading effects are present at each tap position 
in addition to a nonlinearity which results from charge sharing between 
C s and the other capacitances. The minority carrier charge q g is not 
disturbed, however, and the signal results from a redistribution of 
charge originating m the clock electrode drive circuitry. Such a 
technique is employed m the split-electrode type of CCD structures. 
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Figure 2 Floating Clock Electrode Sensor Tap 
Circuit for NDRO of Analog Signals 
in a CCD Delay Line 

A reset switch is employed in the tap circuitry of Figure 2 in order 
to avoid long-term drift effects associated with avalanche injection 
charging of the floating electrode. 
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2.2 Multiplying Elements 

Of the various schemes to achieve electrical analog jnultiplica^ 
tion, only three merit serious consideration for integration with CTD 
analog delay lines: MOSFET triodes, difference of squares using 

MOSFET pentodes, and a combined MOSFET/bipolar log-add-antilog approach. 
The bipolar transconductance multiplier is analogous to the MOSFET 
triode, but is much more sensitive to small voltages than MOSFET' s, 

i 

thereby leading to a potential compatibility problem. The multipliers 
are described m reverse order, leading up to the MOSFET triode which 
has been used in all published implementations 2 r , 3, 4 to date. 


2.2.1. Analog Multiplication with Logarithmic Diodes, Addition , 
Antilog-Transistor 


as 


The characteristic of a junction diode is given approximately 


I 


D 



( 1 ) 


where 1^ is the dc diode current, is the dc voltage across the 
diode, I is the saturation current, A is a constant equal to C38.6V) ^ 
at 27°C. This equation can be rewritten as 

V D 

If two currents I Ql and are fed through two separate diodes, 
the two voltages V D ^, across the respective diodes can be added 

V D1 + V d 2 = (l)jin U D1 I D2 /I S 2 )] (3) 


C2) 
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Note that the sum of the voltages is proportional to the log- 
arithm of the product I Dl I D2 . Thus, if we impress this sum across 
the base-emitter junction of a transistor the collector current will 
be proportional to the product I ^ I D 2' because the collector current 
of a transistor is given as 


— I e 
cs 


AV. 


BE = 


I e 
cs 


A (V. 


Dl 


V D2> 


(4) 


where I is a constant and V__ is the dc base-emitter voltage which 
is now equal to + V D2 - A circuit for implementing this logarith- 

mic addition is shown in Figure 8a. The two currents are in opposite 
polarity. If the common point between the two diodes is grounded, 
then the emitter of the multiplying transistor should not be grounded 
to avoid short-circuiting the lower diode D 2 . A current mirror consis- 
ting of Q 2 Q,_ may be used to provide a high impedance return path 

for the emitter of Q^. 

This circuit has some shortcomings. First, the base-emitter 
voltage, being equal to the sum of the two diode voltages, would cause 
an excessively . large collector current to flow if the device sizes of 
the transistor and the diodes are comparable. From equations (.3) and 
(4) , one can derive the collector current 

III 

_ Dl D2 cs 1 

i c - —2 (5) 

S 

If the areas and the doping concentrations of the junctions of the 
diodes and the transistor are comparable, then the I c would be orders 
of magnitude larger than I ^ and I D2 . 

= I D1 T D2 (6) 
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Figure 3 Various 1 ' Log- Add- Antilog' ' Multiplying Circuits 


9 




Since the saturation current I is usually much smaller than the 
operating current I D , the collector current can be enormous . The 
foregoing problem can be solved by using two base-emitter junctions 
across the two diodes, such as the Darlington pair, (Q^, Q 1 ' ) as 
shown m Figure 3b. However, the collector currents of the two 
transistors differ by a factor of g, the current gam; the respective 
base-emitter voltages are: 



(7) 

( 8 ) 


(9) 

Thus, the output current is proportional to the square root of the 
product of the diode currents. For linear multiplication, we should 
make the diode current proportional to the squareof the multiplicand . 
Furthermore, the signal from the nondestructive readout of a CTD 
should preferably be applied to the gate of a MOS transistor. Thus, 

a MOS transistor, operating in the current saturation (pentode) region 

* - 1 * 

with a square-law characteristic, can be utilized to furnish both the 
buffering and the squaring function. 

The buffered multiplier circuit is shown in Figure 3c. Note 
that complementary MOS transistors are used to let the drain current 
flow into the diode in the proper direction. The dc potentials of 
the gates are at ground to ensure pentode operation, so long as the 
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MOS transistors are enhancement-mode devices . 


Another scheme for accomplishing the same purpose is to use 
a pair of complementary bipolar transistors as shown in Figure Jd. 
The output current is then 



In either Figure 3b or Figure 3d, it is desirable to make 
the base current much smaller than the diode currents. Any base 
current would detract from the diode current and cause deviation 
from the ideal logarithmic relationship. For this reason, it is 
desirable to have a high current gain m at least one of the transis- 
tors . 

In practice, this technique involves both fabrication and per- 
formance problems. The manufacturing sequence, must simultaneously 
yield acceptable quality devices of all of the following: CCD's, 
complementary MOS transistors, and complementary bipolar transistors 
(with the required high current gain) is hardly a well developed or 
widely used process. Overlooking such fabrication difficulties, one 
then encounters the functional problems: In Figure 3c, the voltages 

(V , V ) from the CCD include not only the ac signals (x,y) but also 
x y 

biases (x ,Y ) representative of the zero-signal reference level. 
Thus, the instantaneous output from the circuit of Figure 3c may be 
written as 

Z 2 = (X Q + x) (Y q + y) (11) 
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To obtain the finally desired result, one must similarly generate 
and combine three more products ; 

xy = Z 4 - z 3 + Z 2 - Z 1 (.12) 

where 


z 4 - <V (Y o> < z 3 = (x o + x) <V ' z i - <V (Y o + y> 

To perform the electrical function of equation (12) simultaneously 
would require replicating the complete circuit of Figure 3c four times 
with the CMOS FET's of one such replication IDENTICALLY (not nominally), 
matched for all four replications so as to guarantee that the four 
separate sets of (X Q , Y q ) will truly be equal and cancel. That is, 
each such simulataneous four-quadrant multiplier would need the 
following sets of perfectly matched devices: 4 PMOST pentodes, 4 

NMOST pentodes, 4 high-g NPN transistors with matched diodes and 
-independent accessible nodes. 


Another aspect of a correlator device based on the multiplying 
element of Figure 3c is the operation of the two analog delay lines 
with their respective non-destructive readout. For both the CMOST 
pentode buffers. Ml and M2, both their sources and drains are connected 
to low impedance nodes (practically ac virtual grounds) so that 
interaction from the multiplier through the buffer back into the analog 
delay line is negligible. But the desired pentode squaring action 
suggests that V x be biased above ground while be biased below ground, 
thereby precluding the operation of both delay lines with comparable 
biases and clock levels. In conclusion, however, the necessarily more 
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complicated fabrication sequence to obtain fully accessible, high-p 
complementary bipolar transistors as well as CMOST’s is a powerful 
argument against substantial further consideration of the "log- add-' 
antilog" type multiplying element at this time. 


2.2.2 Analog Multiplication by Pentode Difference of Squares 

This multiplier utilizes the square-law characteristic of an 
MOS transistor operating in current saturation (pentode) region. 

The drain current is given as: 

^ = k < v gs-V 2 (13) 

If is made equal to (V + A + B) for one transistor: 

^2 = K2( V A+B ' V T2> 2 U41 

If related signals are similarly applied to MOSFET pentode gates, 
we find 

+ B - V T1> 2 ' I D3 =K3 ,V o +A - V T3> 2 


J D4 “ K4 <V V T4 ! 


C15) 


If four perfectly matched devices are used, (or, as we shall see 
later, if the four voltages are sequenced onto the same MOSFET pentode) 
so that 


K1 - K2 - K3 - K4 - and V T1 - V T2 - (1 

the four contributions may be combined exactly as in equation (2-12) 
to give 


2K • A • B 


T — T JL.’T T 

D4 D3 D2 D1 


(17) 
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The simplicity of this technique is overwhelmingly attractive 
from almost every aspect MOS fabrication of a single carrier type 
is well developed and not complicated. It would be difficult to 
reduce the transistor count to less than a single MOSFET pentode. 
MOSFET pentodes with lightly doped bodies can perform squaring 
efficiently. Ac virtual grounds at both source and drain of the 
pentode eliminate feedback from the multiplier to the CCD with no ad- 
ditional buffering. And finally, pentode squaring can be achieved 
over a fairly large range of biases. The principal limitation is 
implied m equation (14) which shows that the two independent quanti- 
ties to be multiplied must be ADDED (or subtracted) in order to 
enable the multiply operation. This means that a sum of products such 
as n 



K— 1 (18) 

may easily be obtained by this technique PROVIDED the sets (A R ) and 
(B ) are fixed relative to each other since each (A T , , are mixed 

together by addition or subtraction prior to application to the 
MOSFET pentode gate. Consequently, a single pentode sguarer does 
not allow the relative motion of one data set relative to the other 
as needed to generate such FUNCTIONS as correlation or convolution. 

This functional deficiency may be overcome by the use of a 
second complementary MOSFET pentode in series as shown in Figure 4 , 
and connected to the two independent data delay lines as indicated. 
The drain current is proportional to the square of the sum of the 
two gate voltages, provided that the two transistors are in current 
saturation as follows. 
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Figure 4 The CMOS Pentode "Difference of 
Squares" Multiplier 
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When two signals are applied to the gates of two series CMOS 
transistors, the current must be the same. If the transistors are 
in current saturation, the current varies as the square of the gate 
to source voltage. Thus, for two complementary, but not symmetrical 
transistors, with a ratio m for their values of K, we can represent 
and equate the drain currents as 


K CV X - V s ) 2 = K m tv - VlI ) 2 


The solution for the common source voltage is 


V S = 


V ! + m V II 
1 + m 


and the current is 




K m (V J - V I]: ) 
1 + m 


(19) 


( 20 ) 


(21) 


Note -that "the drain current remains proportional to the square of input voltage 
difference, so long as body effect can be ignored. 

This last limitation, however, is quite serious. For conven- 

■ 15 

tional CMOS, where the doping of the PMOST bodies is about 1 x 10 

1 6 

while that of the NMOST bodies is near 4 x 10 , the NMOST body effect 

completely destroys the desired squaring action. Fortunately, the 
NMOST body effect can be completely eliminated by connecting each 
NMOST source to its respective body. The multiplying element of Figure 
4 does indeed need a buffered interface from the two independent CCD 
analog delay lines. Since complementary source-follower action is 
occurring, this multiplying element can introduce interaction between 
the two independent data sets unless buffers are used to isolate the 
gates from the delay line taps. Furthermore, such buffers must be 



able to provide the two independent ac signals with adequate bias 
separation to allow for the sum of the two threshold voltages plus 
the residual PMOST body effect plus the maximum signal excursion. 

Such buffers are described m more detail in connection with 
their design for the triode multiplier, but we take the liberty of 
including some of the results here that are relevant to this CMOS , 
two-pentode, difference of squares multiplier. With both CCD delay 
line outputs at basically the same dc bias, an NMOS source follower 
shifts one signal m the negative direction. This NMOS buffer uses 
a 7 - to 10 - volt bias and needs only 18 iiamps to give better than 
1-MHz bandwidth for a power consumption of 180 y watts. A depletion- 
mode PMOS source-follower using a 5-volt supply and drawing 20u amps 
for a power of lOOy watts also gives a 1-MHz bandwidth for the other 
analog signal buffer. The resultant dc separation between the two 
signals is increased form zero to 4.5 volts. The circuit of Figure 4 
was computer simulated with these results ;Ln mind. The circuit model 
with its operational listing are given m Figure 5 , while the resul- 
tant dc operating points and the output Fourier analysis are shown in 
Figure 6 . Equations (14) through (17) determined the circuit model with 
the sequencing of different signals through the series pentode pair 
(as m equation (16) replaced by the simultaneous or parallel arrange- 
ment shown. The spectral analysis of Figure 11 indicates a complete 
error rejection in excess of 50 dB. Increasing device sizes somewhat 
to assure a safer bias margin gives extrapolated values like lQy amps 
from a 10 -volt supply.. 

Thus, a 1-MHz CTD/CMOS difference of squares multiplying element 
element with about 50 dB error rejection would need Cl 80 + 100 + 100).y 
watts = 0.38 mwatts, and requires only 6 MOSFETs including buffers. 
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furthermore , the pentode squaring action does not require 
extremely critical biases or balancing which we will find to be a 
serious problem with triode multipliers. On-chip, monolithic 
signal-summation from the drain of pentodes (like current sources) 
is significantly easier than summing currents m an on-chip voltage 
-controlled ac-virtual-ground node. The implementation of the needed 
sequencing and an attractive block diagram to perform the above 
multiplication will be presented in a later section. 

Although the preceding description appears extremely attractive, 

two potential problem areas have been submerged therein. First, in 

Figure 5 , one notes the doping levels used for the CMOS fabrication: 

14 -3 

Starting material of 5X10 cm with the complementary isolation tubs 

15 -3 

implanted and driven to give a value of 5X10 cm . Such very light 

doping seems to be the only way to achieve the needed pentode squaring 

16 -3 

by the NMOST devices because typical levels of 4X10 cm give very 
poor pentode squaring. Secondly is the difficulty of obtaining such 
devices for laboratory evaluation of the pentode squaring action, since 
such a CMOS process is a deviation from well developed techniques and 
would require some adjustment of the fabrication steps. 

Consequently, we feel that the CMOS difference of squares approach 
certainly warrants more investigation than computer modelling; but the 
-likely heed -for sbme .fabrication process development suggests it is a 
slightly higher risk approach relative to the triode technique, which 
does not need fabrication development and thereby provides more rapid 
chip development albeit at the expense of chip power consumption. 
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2.2.3 Analog Multiplication by MOSFET Tnode 


The multiplication function can be performed with MOS transistors 
operating m the triode region. The drain current I D of an MOS transistor 
can be approximated as 


J D “ “ W GS 


- V V DS 


+ pv. 


DS 


( 22 ) 


where V Gg as the gate to source voltage, V T is the threshold voltage, 
V DS is the dram to source voltage and (a, 3) are physical parameters 
proportional to the width to length ratio and mobility of the channel 
When the dram to source voltage is small 


X D , “ (V GS- V T )V DS 1231 

If V GS and V DS represent two signals, then the drain current has a 
term (a V Qg‘ V j}g) which is proportional to the product of the two 
signals. If the two signals represented by V G g and V^g are substitu- 
ted m equation (22) or equation (23) there are terms other than the 

2 

desired multiplied outputs such as a v D g' v T or 3/2 ( v D g) - These 

extraneous components of the dram current can be reduced an a balanced 

multiplier as shown in Figure 12. But the signals may contian dc biases 

which must eventually be eliminated. One signal is given as V x sin 6 

with a dc offset V . The other signal is V sin 0 with a dc offset 

a y 

V^. The signal V x + V sm 0 is applied to the common dram ,of QM and 
QR via the buffer (QA, QL) . The signal V + V sin / is applied to the 

jl X 

gate of QM, but only the offset voltage is applied to the gate of QR. 
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The currents of QM and QR can be obtained by substituting m Eg. (22) 
V x + V x sin 0 as V Dg and V y + V y sin 0 as V GS for QM and V y as V Gg fc 
QR- 

13 = IM = a M [ (V y + V y sin 0 - V TM ) (V x + V x sxn 0) ] 

[ (V v + V sine ) 2 1 

M 1 X x ' J (24) 

14 = >IR = a R [ (V y - V TR ) (V x + V x sin 6) ] + 

P R f (V X + V x sin 0)2 ] (25) 

The differential output current I is obtained by subtracting 
equation (25) from equation (24) , assuming identical physical 
parameters (.a =a , 3 M =6 R ) and a threshold difference AV m = V TR 

” V TM : 


I = IM - 'iR = aV V sin 0 sm 0 + 
Du x y 


ajv„V . sin & + (V„ + V sine) (AV m )l (26) 


v x + sin 6 


sin a 


Vy + Vy Sin 0 

_L 


X 


AC Virtual 
Ground, 
Differential 
Current 
Meter 
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Figure 12 Conceptual Singly-Balanced 
Triode Multiplier 





From this last equation, several important conclusions 
can be drawn. First, we see that any mismatch between the nominally 
balanced pair of MOSFET triodes such as threshold (AV T ) , or equiva- 
lently their dc bias, contributes to the differential output a term 
which is directly proportional to the signal applied between drain 

and source of the triode. Next we see that any dc component of the 
drain/source voltage, V , contributes an output term which is 

directly proportional to the voltage difference between the gates 
of the two triodes, (QM, QR) . These two observations lead us 
even further into the question: 

How can a MOSFET triode pair be operated in practice to 
minimize the undesired output contributions fed through directly 
from the input signals m proportion to the extent of unbalance 
between the two MOSFET' s? 

This problem area we shall refer to as the rejection of 
input feedthrough due to a lack of precise or detailed differential 
balance arising mostly from MOSFET array threshold land other) non- 
uniformities, which will be discussed in more detail in section five. 
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3.0 MULTIPLIER BIAS CONCEPTS 


The existing partially-integrated singly-balanced- 

2 4 5 

triode-multiplier correlator chips ' ' have demonstrated the 
need for elaborate routines for operator --control led , off-chip 
optimal bia'smg because of two distinctly different reasons: 
for best triode multiplication, the source to gate bias should 
be much larger than the source to drain bias which sould be as 
close to zero as practical. Consequently, user appeal is much 
enhanced by use of circuits and architecture which automatically 
provides on chip the following: (1) Zero soruce-drain bias and 

balanced source-gate biases for triode multipliers, (2) adequate 
offset for triode source-gate bias or the complementary gates for 
the CMOS pentodes difference of squares multiplier. 

3.1 Triode Self-Bias 

An illustrative example of one way to automatically set 
the bias on a MOSFET triode multiplier is given m Figure 13. In 
addition to the CTD stages propagating the actual data, nominally 
identical reference stages containing only the ZERO signal reference 
levels (e.g. , X o , Y q ) provide two reference levels: (1) Y q sets the 

dc level applied to the gate of the reference triode, QR, to the 
approximate dc value on the gate of QM; (2) X Q sets the dc potential 
(V ) of the "virtual ground" current-summing-node to approximate 
equality with the dc potential applied to the triode drains , V^. It 
is important to note this self-bias concept optimally biases the 
triode multiplier array; that is, over the array, the average unbalance 
between individual sources and drains or between gate pairs goes to 
an acceptably small value. But the concept of Figures by itself, 




Figure 8 MOS Analog Triode Multiplier Array 
Self-Bias Concept 
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does not achieve the detailed differential balance of each multiplier 
within the array needed to give a high rejection of input feedthrough. 

When no .provision is made for any self-bias scheme, ei-ther 
like that illustrated m Figure 8 or some alternative, each such 
partially integrated device ' required IN-SITU biasing and fine 
adjustment. A frequently used technique for such fine adjustment is 
the spectral analysis approach implied in equation [ 26 ]} that is, two 
different sine waves are applied to the device and the output spectrum 
is monitored while the biases are adjusted. The dc value of the vir- 
tual ground summing node is set to minimize the feedthrough of the 
sine wave applied to the gates of the multiplier, thereby yielding 
the minimum average source/drain bias across the array. For singly- 
balanced triode multipliers, as illustrated in Figure 7 and 8 , where 
dc is applied to the gate of the reference triode, QR, the dc gate- 
bias for QR is likewise operator adjusted to minimize the feedthrough 
of the sine wave applied to the triode drains - thus giving the mini- 
mum average offset between multiplier gate pairs (inclusive of 
thresholds) , but not necessarily balancing any pair perfectly. This 
routine for operator optimum biasing must be repeated whenever the 
conditions forthe CTD analog delay line are changed so as to affect 
the signals from the NDRO buffers, if input feedthrough rejection is 
to be maintained at its maximum value. Most practical system appli^ 
cations for such analog/analog convolvers would not be able to 
tolerate such a bias adjustment routine On the other hand, use 

2 4 5 

of such a routine greatly facilitated partially integrated devices r ' 
demonstrating the basic principle of CTD/MOSFET-triode correlator action. 



3.2 Complementary Buffers 


In addition to the preceding feature desired by large 
systems users, another user attraction is ability to operate both 
analog delay lines with comparable voltage levels. 


To permit such operation, the gate and drain buffer 
dc output levels can be used to properly bias the MOSFET triode 
multiplier which needs: small impedance (.large conductance) for the 
drain buffer, minimal bias of drain to source, large bias from gate 
to (source, drain) . We now must add' the consideration of low power 
operation which most strongly affects the dram buffer, since that 
circuit must be able to supply the triode drain current. Assuming 
the simplicity of enhancement mode FET ' s with a CTD NDRO bias of 
V_, one quickly finds that the two options become: 

V3 




Power = I V r 
o C 


Source Follower: g = — 

^m (V g /2)-V t 

Power > 21 ' V_ (for unity gain) 
' — O G 


Inverter ; 


g m = 


21 ' 

o 

V G" V T 


Requiring equal g^ gives us 

V - V 

! i = T . 

O (V g /2) - V t o 

Thus to give the same output drive, with the same gam, an inverter 
consumes more than twice as much power as a source follower , thereby 
dictating the use of a source follower to drive the triode drains. 


Buffer configurations to bias the multiplier triode 
follow directly from the last observation. Source follower stages 
generally shift the output dc level toward the potential of their 
body. Extrapolating these generalities to CMOS circuits, we easily 



see a sxgnal applied to an NMOS source follower xs shifted toward 
its body voltage and away from the voltage of the common PMOS 
body. Tabulated below are the preferred buffer conf xgurations to 
give the needed triode bias: 

Buffer 
Drain 
Gate 


Sxngular MQS CMOS 

Source Follower Source Follower 

Inverter Complementary Source 

Follower 



4.0 DISTORTION ERROR TERMS FOR TRIODE MULTIPLIERS 


Since the computation for the product by any REAL, CTD- 
compatible multiplier is necessarily imperfect, one must describe 
the various possible sources of errors. A very sensitive technique 
to study error source is spectral analysis. The application of 
two distinct sinusoidal signals to an ideal multiplier or correla- 
tor gives a well defined output, especially in the spectral or 
frequency domain. The amplitude of any other frequency components 
in the output (such as harmonics or higher intermodulation products-) 
is a measure of an associated error source in the multipliers. In 
this section, we consider three such "harmonic" sources of error. 

4.1 "Gate Signal" Harmonics form the Drain Buffer 


In Figures 7 and 8 , the common (drain) conductance node 
is driven by a source-follower MOSFET stage with both transistors 
in the pentode mode (characterized by a squaring relationship 
between gate-source voltage and drain current) . In a singly balanced 
multipler, the modulated conductance forces the- source-follower 
drain driver to supply a current related to the signals applied to 
the gate of the modulated conductance. If the transconductance 
of the source- follower drain driver ( 9 *b) is inadequate compared to 
the modulated conductance ^ an erroneous voltage harmonically 

related to the multiplier gate signal is added to the common (drain) 
conductance node. This result may be expressed in formulas as follows: 
Let 




cm 
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be the output impedance of the drain buffer. Then a voltage I 
Rqb would appear at the dram: 


xy 


I R ^ = a V V R-.-., where a =jjC 
xy OB m x y OB' m ox \L/ 


(28) 


When multiplied by the gate signal V f an undesired product current 
is produced: 


1 = % V y V x V y R 0B> • 


C29) 


This current should be much less than the desired product current 

a V V . Since the distortion described by equation (29) is clearly 
m x y 

associated with the signal applied to the gate, even through the 
drain driver is the cause of the problem, let us define a gate signal 


distortion rejection ratio. 



r 

g 


a V V 
m x y 


a V (a V V R^ d ) 
m y m x y OB 


1 


a V R,_ D 
my OB 


or 



< W / L >B • (V GS - Vb 


(W/L) 


m 


V 

y 


(30) 


For the purposes of giving examples, let us assume a desired rejec- 
tion of 40 dB (i.e., 1%). Thus, for example, if one assumes the 
geometric ratio of MOSFET's in equation (21) to be 50 with a typical 
source follower bias (V^_ - V m ) D ~ 2 volts, one obtains 

W = 1 TOlt for r g = 100 • 

4.2. "Drain Signal" Harmonics from the Gate Buffer 


The previous section shows one absolute requirement for 
the buffer used to drive the drain of the triode . The parameters 
of this particular buffer relative to the multiplier triode, the 



buffer's bias and the ac signal on the triode gate, mutually deter- 
mine the gate signal distortion rejection ratio, r , as in equation 
(30) . But, in some cases buffers to drive multiplier gates may 
serve only to provide reverse isolation to prevent signals within 
the multiplier from adversely affecting analog delay within the 
associated CTD. Even though some nondestructive readout schemes 
circumvent such a problem, the following related difficulty can occur. 

Any impedance in series with the gate of the multiplier can 
develop a voltage by virtue of the feedthrough capacitance from the 
drain of the multiplier transistor. If the multiplier gate signal 
is directly derived from the floating gate of a CCD with sensing 
capacitance, C„„, the impedance of the driver is the impedance of the 
capacitance C F( -,. The voltage feedthrough is equal to v x C g ( j m /^pG / 
where C g ^ m is the multiplier triode gate to drain overlap capacitance. 
If the desired signal is V , the rejection ratio limited by gate 
interaction is simply 

VC 

v X = RR(G) 

x gdm (31) 


One way to reduce the adverse effect of the feedthrough 
from the drain to the gate of the multiplier transistor is to place 
a buffer between the floating gate of the CCD and the gate of the 
multiplier. Since the gate buffer output impedance is approximately 
1/(9 ) q B / the feedthrough signal from the drain now becomes 


j C 


gem 


^rt^GB 


(g m J GB 


(32) 
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Thus the gate buffer changes equation (31) to 


V 


RRtG) = 


V 


x 


1 + 


(g m ) GB 


jwC 


gdm 


( 33 ) 


Typically, for C^.^ <_ 0.02 pP and one megahertz, the overlap coupling 
becomes 126 nanomhos, which may be easily buffered by minimal FET 
stages of ten to a hundred micromhos. 


Consideration of the buffer frequency response gives a 
further condition related to that of equation (33) . If GC is the 
gate buffer corner frequency, we have 


<V 


'GC 


GB 


2ir { C + C ) 
v GBM GP 


(34) 


where C GBM is the multiplier gate to body capacitance and C Gp is 
the parasitic capacitance on the multiplier gate node. Combining 
equations (33) and (34) gives 


V 

RR(G) = 

x 



C + r 
U GBM ^GP 

j C , 

J gdm 



(35) 


Equation (35) suggests that self-aligned polygate technology could 
help reduce C gc j m and the transconductance of the gate buffer, 
must be designed to give the needed rejection at the maximum operating 
frequency. Failure to observe this precaution will contribute an 
error term behaving like the square of the drain signal and thus 
becomes almost indistinguishable from the triode nonlinearity of an 
overly large drain signal, which is described immediately below. 


4.3 "Drain Signal" Harmonics from the Multiplying Triode 

To evaluate drain signal distortion rejection ratio, r^, 
we require that the MOSFET multiplier always be well into the triode 
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region. 


Furthermore , 
triode characteristic 

2 (V 


we assume is directly related to that 
; or for the worst case : 


GS V T^ r d (V DS J 


( 36 ) 


m m 

This assumption is very simplistic since the triode drain-gate 
coupling of equation (33) , which causes the same functional distor- 
tion, is not incorporated. At this point we must start including 
quantitatively such array problems as threshold voltage nonuniformity, 
the average value of which we designate by AV^, In predicting the 
worst case for equation (36) we must assume the self-bias technique 
can result in dc errors comparable to AV T . Thus, equation (361 may 
be written 


2 ^0 - v T m - av t - W >r d ( AV t « max ) 

<V 2 > x max + WS< Y o- V T > - d- r„/2) < AVj). , 37 ) 

m 


Consequently, equations (30) and (37) prescribe upper limits on 
signal levels in terms of array nonuniformities, geometry, biases, 
and distortion rejection ratios. 



5. ARRAY THRESHOLD NONUNIFORMITIES 


At the end of section two, we saw how signals applied 
to a balanced triode multiplier were fed directly through to the 
output due to imperfect matching caused by typical array threshold non- 
uniformities . 


5.1 Input Feedthrough Rejection, <J> , for Balanced Triodes 


We must now apply equation (26) to an array of matched triodes 
using the self-bias technique to give an average source-drain dc value of 
AV t with the applied ac signals written as (x,y) yielding: 

^o-V txy + y < AV V + X ( AV T )J 

Taking the ratio of the desired signal to the undesired input feed- 
through from equation (38) gives the rejection, \ p, which limits the 

minimum usable (x,y) : , ..... 

(x . , y . (AV ). 

v min’ ^nmr 1 T 


Thus, for a typical AV T ~ 0.1 volt and a desired input rejection of 
40 dB, one finds y m j_ n ) ~ 10 volts. This result alerts us to a 

significant technical problem: each matched triode pair must undergo 

a detailed balance which self-bias cannot provide if there is to be 
any hope of .achieving acceptable values of input rejection. 


5.2 Compensation by Feedback in Floating Gate Reset 

In this section, consider nondestructive readout only by means 
of floating ijates. If one could selectively address and adjust the 
dc bias on ed‘dh floating gate sensor, then one may postulate the 
following operation. After loading both CCD shift registers with 
zero-signal, reference-level-only charge samples, a "unit one" sample is 



propagated through one of the delay lines . The output of the correla- 
tor to such a "unit one" is a random pattern which measures the non- 
uniformities from stage to stage of the other delay line for the com- 
bination of both the floating gate sensor and its associated buffer. 

If now a feedback loop were closed from the correlator output through 
a selective switch addressing only the stage associated with the "unit 
one" , the feedback amplifier could sense any deviation of the correla- 
tor output from a prescribed fixed level and, thence, adjust the bias 
on that selected floating gate to obtain a null before moving on to 
the next stage. Theoretically, this could be iterated for both channels 
so as to completely compensate for threshold nonuniformities everywhere 
in the array. 

In practice, however, several problems appear likely. The 
published data 2,3 and other estimates indicate that compensation of 
threshold nonuniformities must fall in the range of 40 to 60 dB 
to enable even a remote chance of obtaining any usable input dynamic 
range. Since typical MOS nonuniformities are about 100 millivolts, 

N 

switched feedback control to better than 1 millivolt is thus called for. 
But operation of the selective/address feedback sv/itches introduce 
feedthroughs (even when shielded) due to parasitic and channel coupling 
capacitance comparable to or larger than a millivolt. Furthermore, such 
a feedback scheme requires an exclusive "house-keeping" interval; and the 
compensation stays valid only for short periods of time as thermal 
leakage fixed patterns destroy the feedback settings very rapidly at 
MIL Spec temperatures. In contrast, the sequential multiply process 



continuously compensates for threshold nonuniformities independent of 
ambient temperature to better than 50 dB accuracy, as we see in sub- 
sequent sections. 

5.3 Compensation with the Sequential Multiply Operator 

The use of CTD's to provide analog delay with nondestructive 
readout necessarily requires sensing by means of PET gates with all 
their attendant array threshold nonuniformities . This alone is a strong 
argument for a sequential approach whereby a calculation is performed 
in parts, serially, using the same device elements as well as "zero- 
signal reference levels for each contributory computation, independent 
of the functional form of the multiplying element. Traditionally, highly 
accurate multipliers in analog computing technology used time division 
multiplexing between the signals and selected references. Equations 
(11) and (12) suggest the sequential scheme for any ideal multiplier. 

We now describe the operation in detail and derive some of the extra 
benefits the technique offers for the triode multiplier. 

In the sequential multiplying scheme, all the undesired pro- 
duct terms associated with the dc "fat-zero" or threshold voltage 
deviations are cancelled sequentially in the same multiplying transistor. 
The discrete/hybrid circuitry for accomplishing this time cancellation 
is shown in Figures 9 through 12. In both the X and Y CTD channels, 
samples of signals ' ac-zero reference level only (X Q , Y Q ) and reference- 
plus-signal" | (Xo + x) , (Y 0 + y) j occupy alternate stages. The product 

current (I-) due to the common (Y ) multiplied by X-channel signals is 
balanced against the product current (1+) due to Y-channel signal mul- 
tiplied by X-channel signals. All the incremental (I-)'s are fed to 
a summing amplifier and the (1+) *s to another summing amplifier, to give 


36 




Figure 9 Step One of Sequential Multiply Operation 
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FIGURE 10 Step Two of Sequential Multiply Operation 
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FIGURE 11 Step Three of Sequential Multiply Operation 
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FIGURE 12 Step Four of Sequential Multiply Operation 












the differential output currents, AI (t) = (1+) (t) - (X-) (t) J . 


The signal samples + k~ 1) 7 X ( J + K)'*** are se P arate< ^ 
by common . ...X Q and Y q respectively. To multiply the quantities 
X J + K' and y k' four ste P s are required: The X-channel signals are 

advanced with respect to the Y-channel signal, as illustrated m 
figure 9 . In the first step, the signal in X-channel CTD is transferred 
forward by one stage with respect to the Y-channel. 

The signal sample x (j +k ) is multiplied by the common Y q . The product 
current is: 


N 


I+ (l)= £t« k (Y 0 -V T4 ) (x J+k + V 
k=l 

k 


T X 
, 2, 


(40) 


" V TX ) + P k (X Jtk + V TK ~ V TK } 1 


th 

where k indicates the k multiplier, 

k th 

V m . is the equivalent threshold voltage of the k multiplying 

XT 

MOS transistor. 

k th 

V is the equivalent threshold voltage of the k source follower 

buffer . 


V mv is the equivalent threshold voltage of the common/dummy 
IX 

source opening of switch CS1, as m figure 9 . 


to the 
x (J+k) 


In the second step, the Y-channel is advanced with respect 
X-channel and the signal Y q + Y^ is multiplied by the unchanged 
to give the summation current: 


N 


1 + (2) £ [a I <V - V T^> 

(X J+k + V TX “ V TX ) +|3 k (X J+k + V TX “ V TX } ] 


( 41 ) 



The switches are now set as in figure 10 such that the 


previous value associated with equation (2-63) subtracted from the 
new value equation (41) difference value: 

N 

O + fc (42) 

[I + (2)] - [I + (1)] = L* [a k (y k ) (x J+fc + - V^')] 

k-1 


clamped directionally onto capacitor CB by the operation of the 
switches CS2 and CS3. Note that equation (42) depends on only a 
single multiplying transistor at each multiplier location and is ba- 
sically identical to equation (26) for the balanbed multiplier. 

In the third step shown in figure 11 the X-channel signals 
are advanced with respect to the Y-channel signal. The y-channel signal 
Y c + y^ is multiplied by the common X Q . The summation current, 

N 

I + (3) = £ [“ 7 (Y + V, - VrJS - V™ ) + (V k - V '} 2 1 (43) 

k=l k V 0 ^ T+ 1 TX TX' p k ' IX V TX' J 

gives a third output which is clamped onto CA by means of CS1 as in the 
first step. Prior to the fourth step, read switch RD is closed as in 
Figure 11, so that value stored on CB adds to the output of the pre- 
ceding amplifier. 


In the final step as illustrated in Figure 12 , the Y-channel 
signal may be either advanced, if a completely new vector dot-product 
is desired, or receded with respect to the X-channel, if additional 
dot-products with the same replica vector are desired, as for cor- 
relation or convolution functions . The common X Q and common Y q are 
multiplied. The summation current is 


N 

I + (4) = 2 j> + (Y 
k=l K c 


V T? ^TX 


- ) + 0 k (V^ ) ] 


(44) 
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As in the second step, the value stored on capacitor CA from 
step 3 subtracts from the fourth partial output, to give a second 
difference which is added to the first difference (as stored on 
capacitor CB) for a final four-quadrant Siam of products output. 


V out (J> AI = [ AI(4) - AI(3) + AI(2) - Al(l)] 

N . ' (45) 

= R “k Wfe 

NOTE THAT ONLY THE DESIRED SIGNAL PRODUCT APPEARS AT THE OUTPUT, WITH 
■MULTIPLIER "PURE DRAIN-SOURCE NONLINEARITIES" ALSO CANCELLED. THUS, IN 
THIS MANNER, THE SEQUENTIAL PROCESSOR ENHANCES THE DRAIN HARMONIC 
DISTORTION REJECTION, P d , FOR THE SINGLY-BALANCED TRIODE . 


5 .'4 A Balanced-Input Monolithic Sequential Processor 

A monolithic implementation of the sequential processor 
described above is illustrated in Figure 13. The inherent differential 
action of the 11 fill/spill" input technique is used to subtract the 
applied balanced signals. A built-in bias-charge potential barrier, 

V_, is used to guarantee that balanced signals of either sign can be 

iD 

processed. 

Thus, when charge is metered and read while switch 0 O is 
activated, the quantity of charge is given by Q* = (CIN) © where 

the effective input capacitance is given by CIN = COX + CG2 + CIG 1 . The 
resultant potential increment on the output collecting diode, DOUT , 
and hence, the output capacitance for that node, COUT, is given by 

CIN 

with an associated device gain of G = COUT 
If multiple charge packets (such as N) are metered and collected on the 
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Figure 13 A Monolithic 
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output capacitor (to give the effect of algebraic addition) before that 
node is reset to make the next measurement, the overall voltage 
excursion is limited by the bias applied to the output circuit: V R £= 
reset reference voltage so that: N • (V) <V RR / or v rr >n * G ’ v b* This 
formula establishes a limiting relationship between the device gain, 
built-m barrier, algebraic addition multiplicity, and the reference 
reset bias voltage. 

If the current-summing-to-voltage converters have an effec- 
tive offset or threshold difference designated by A, while the resultant 
output voltages are indicated by (K+) , the sequence of charge packets 
metered on CIN and collected on COUT after the reset operation becomes: 

a) Reset the output capacitor, then activate 0 Q , disable 0 + 
and 0- , repeat the metering operation four times to give Q q = 4Q* . 

: k 

The output of the electrometer at that instant (V Q ) is stored on the 
"difference" capacitor, CD, by momentarily activating the clamp switch 
'via the 0C pulse, in the direction indicated in figure 18, which 

■k 

effectively subtracts V Q from all subsequent signals output by the 
electrometer. 

b) Reset the output capacitor, then activate 0+ , disable 0-and 
to meter 0-^ = 0* + CIN (A + (A+) - (A-) , onto COUT. 

c) Activate 0- , disable 0+ and 0 Q , to add Q 2 = Q* + CIN ( (B~l -A- 

(B+) ) , with Q 1 on COUT. 
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A 


d) Activate 0 + , disable 0 - and 0 Q , to add Q 3 = Q* + CIN 
(A + (C+)-(C-), with {Q^ + Q 2 ) on COUT . 

e) Activate 0 - , disable 0 + and 0 } to add = Q* CIN (D-) - 
(D+) , with (Q x + Q 2 + Q 3 ) on COUT. 

With all four charge packets added algebraically 'on the output capacitor: 

4 

SQ r = 4Q* + CIN • [5 A -5B +5C -SD], where 5K = (K-J-) - (K-). 

^ k (46) 

But the voltage V Q * , derived from the electrometer output at a time 
when Q q = 4Q* had been added on its gate, previously stored on CD 
cancels the contribution from 4Q, leaving only the increment 

16A - 5B+5C ' SD! ’ 

applied to the gate of the clamp buffer. At this instant, the Sample/ 

Hold switch is momentarily activated to update the final output stage. 

In this manner, a fully reconstructed step-wise output waveform, including 
dc restoration, is generated. The clamp or dc restoration part of the 
circuit is optional and may be omitted if ac coupling is permitted. 



5.4 A MONOLITHIC READOUT WITH SELF-BIAS AND DIFFERENTIAL 

CURRENT-TO-VOLTAGE CONVERSION 


The actual multiplying elements need a readout scheme which 
provides DC-balanced virtual ground input nodes, the DC bias of which 
is controlled by a voltage applied to further node. One conceptual and 


two simplistic forms of a single-sided or unbalanced version of such 
a scheme are illustrated in Figure 14 . Since most analog multiplying 
devices consist of counterbalancing elements, yielding currents which 



must be subtracted, an attractive technique gives the differential 
current directly from high impedance current sources that can be added 
in parallel (without interaction) to feed a high resistance for con- 
version of small differential signal currents into large signal vol- 
tages. Use of such a desirable large resistor in the circuits of 
Figures 14-b and, 1-4-c is severely limited, however, because the full 
bias current needed to increase the g of the virtual-ground stage must 
flow through that resistor. Consequently, paired circuits, each like 
that of Figure 14*^b mated with the differential monolithic sequential 
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processor, give smaller differential signal voltages due to the 
smaller allowed resistor; thereby, yielding a reduced output signal-to 
noise ratio (S/N) or requiring a larger area output sequential pro- 
cessor to maintain the same S/N achieved with a higher resistance. 


A possible CMOS circuit to difference the two currents (cancelling 
the DC bias components), so as to enable higher resistances to be used 
is illustrated in Figure 15. The circuit consists partly of a DC 
balanced pair of virtual-ground stages similar to those of Figure 19-b. 
The remainder is the basic current differencing part of the circuit and 
closely resembles a modified current-mirror circuit. That is, the 
currents through transistors (DXCS) and (NOCS) , which are nominally 
matched, try to establish themselves at very near equality 


*D + X P “ + 41 


(47) 


where (I p , I N ) are the signal (IXP, IXN, IY) , plus bias currents 

for the two counterbalancing legs of the multiplier as fed into their 

respective AC virtual ground nodes, AI is a fixed incremental offset 

current to enable readout of bipolar current differentials, and I p 

is the differential readout current. When a complete quadruply balanced 

triode multiplier cell replaces the test sources, (IXP, IXN, .IY) , 

the circuit of Figure 16 results. 


In Figure 16 the transistors to the left of the dot-dash line are 
those belonging to whatever multiplier cell is selected and needs to 
be duplicated for each such multiplier needed within the correlator 
chip. The remaining elements form part of the readout mechanism and 
may be scaled according to the number of multipliers involved, for 
incorporation as a single common readout stage. Some advantages, 
however, may be obtained if the part of the readout stage which provides 
the DC-balanced AC-virtual-ground node-pair, and which is enclosed by 
a dashed box in Figure 15 is distributed along with each individual 
multiplier fKtker than lumped into a common readout stage: The 

resultant closer proximity of each DC-controlled AC-virtual ground 
node to its associated counterparty which includes one of the input 
signals^may give statistically better matching across each multiplier 




Figure 15 A CMOS CURRENT-DIFFERENCING READOUT WITH INPUT CURRENT SUMMATION 
AND OPTIONAL DC- BALANCED, AC-VIRTUAL-GROUND INPUTS 
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as well as some averaging across the array and a more uniform dis- 
tribution of heat generation around the chip. 

We may now briefly describe the circuit operation, assuming the 
bias currents for the AC-virtual ground stages are given by (IB+) : 


I p = (IB+ ) + (IXP ) - (IY) 

I N - (IB—) + (XXN) + (IY) 

Combining the last few equations gives : 


(48) 


I D ~ (AI) + 2 (IY) + { (IXN)-(IXP) >+{ (IB-)-(IB+) } (49) 

For MOS triode-type multipliers, (IXP, IXN) are the drain-source 
currents when a common signal is applied jointly to both drains, so that 
any AC component of such a common drain signal cancels to the extent 
that the matched pair of MOSFET triodes are truly identical. The 
last equation further tells us that the fixed bias current, AI, must 
be comparable to the worst case of unbalance for the virtual-ground 
stage biases , PLUS the unbalance between the nominally matched multi- 
plier MOSFETS, plus the peak signal excursion. Consequently, one may 
ascribe a common-mode rejection ratio (CMRR) to this circuit: 


IN 

CMRR = — (50) 

IX ! OUT 

Indeed, this parameter as well as other measures of performance 
have been modeled using the commercially available ISPICE routine. The 
switch from the computer aided design routines like MSINC to ISPICE 
was made primarily because of the superior convergence of the latter 
routine; and ISPICE also combined the most attractive features of both 
MSINC and older SPICE programs for the pertinent class of problems: 
a) Transistor input specifications are those needed by design/process 
engineers such as dimensions and doping densities, b) Fourier analysis 



of multiple/critical nodes , c) Sensitivity to small threshold non- 
uniformities, d) Tabulation of transistor operating points for the 
circuit quiescent point, e) Swept frequency response to give mainly 
high frequency limitations. The ISPICE program list for the circuit of 
Figure 15 is_shown in Figure 17. 

The CSRO (QOS _Selective Read-Out) program list a£ Fig. 17 illustrates various features 

of both the ISPICE routine, in general, and the CMOS circuit, in 

particular. In addition to the five advantages mentioned above, we 

note that the transistor models may be entered in a parametric form 

so as to accentuate the quantities which the design engineer can 

easily change on his mask set (like channel widths and lengths) or on 

his process sheet (like threshold and body doping.) Thus we can 

very easily compare performance between a conventional CMOS process 

(with approximately two volt enhancement thresholds and body dopings 

of lxlO i5 cm ^ and 4xl6 1 ^cm 3 for the P and N channel MOSFET's, respectively) 

and that indicated in Figure 17 ^ The wafer processing indicated in 

Figure 17 emphasizes low doping densities: The starting N-wafer has 

14 — 3 

a doping of 5x10, cm , which receives a P-Isolation tub implanted and 

15 -3 

driven to give a density of 5x10“ C m , yielding an N-MOST threshold of 
about one volt enhancement. Another P-implant gives depletion mode 
P-MOST’s with pinch-off voltages near two volts. As we have already 
seen in the sections on the multipliers and their buffers, depletion ncde 
P-MDST ' s give better performance with reduced voltage (and power) than enhancement 
mode transistors. Many of the transistors have five micron source/drain spacings 
which start on .the working plates at about eight microns , thereby making pnotoen- 
graving-resolution relatively easy . 

The bias voltages have been selected to allow the sensitivity 
routine for parametric fluctuations on voltage sources to simulate 
threshold nonuniformities across' an array. The applied or input 
signal currents are spectrally separated: AIX = l}iamp at 10 KHz and 

AIY = O.lyamp at 50 KHz, so that { (AIX) / (AIY) } = 10. The spectral 

separation facilitates pinpointing sources of harmonic distortion as 
well as reading the respective output signal componerits for determining 
the CMRR. 
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Figure 17- ISPICE Program List for the CMOS Selective Readout 
(CSRO) Circuit 
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The quiescent operating point of the circuit is shown in Figure 
including the operating status for the MOSFET's all of which are 
well into the pentode region. The power consumption for the circuit 
is also very easily computed from the data of Figure 18. Figure ±y 
shows the sensitivity of the output to such parameter changes as 
device threshold voltages. Let's consider the entries in the table 
under "voltage sources", except for the row labeled "VDD". The column 
labeled "1.00% SENS" gives the voltage change in the output node (48) 
when the named voltage source is changed by 1%, which is 7.5 millivolts 
in this case. Since (V6) is associated with transistor DXCS; the 
table indicates a 7.5 mV change in the threshold of DXCS results in a 
48 mV change in the output node voltage, which is equivalent to 
0.48ua change in current flowing through transistor (NSO) . This 
indication of sensitivity, however, assumes linear variations based on 
the small signal parameters for the circuit at its quiescent point. 

Also included in Figure 19 is a frequency response curve for the "CSRO" (CMOS 
Selective Readout) circuit showing both the output node (48) labeled "1" and a 
key internal node (47) on the gate of the "subtracting" transistor 
(NSO) labeled"2". With the one picofarad loading on the output node, 
as if connected directly to a monolithic sequential processor, the 
circuit appears capable of nearly one megahertz operation. The spectral 
analysis of the output of the circuit, as measured well below its 
cutoff frequency, is tabulated in Figure 20. The simulated drain 
currents for a doubly-balanced triode multiplier correspond to the 
first harmonic, while the fifth harmonic corresponds to the gate 
signal. The rejection of harmonic distortion of the drain signal 
(i.e., the square root of the sum of the squares of the second and 
third harmonics) appears slightly better than 40 dB. The nonlinear 
interaction between the fundamental and the fifth harmonic (gate signal) 
is noticeable, but still very small, in the fourth and sixth har- 
monics . But the most striking feature is the cancellation of the 
fundamental relative to the fifth harmonic to 1/39 in the output 
from 10/1 in the input with an overall common mode rejection ratio of 

CMRR = -20 log (3.328 x 10~ 3 ) = 49.6dB 
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Figure IS The Quiescent Operating Point 
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Figure 19 Sensitivity to Threshold changes and Swept Frequency Response 
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Figure 20: Fourier Analysis of Output 
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Other absolute signal values may also be of interest. The previous 

formula for the output of a perfectly subtracting circuit gives 

AIY} qut - 2{AIY} in . But Figure 20 shows AIY}^ = AVY} QuT /R^ = 131.3 

nanoamps with an input current of O.lya. to give an overall attenuation 

of the difference signal of about 3.7dB. These results depend on both 
the applied voltages and how well the nominally paired transistors 
{ (POA/PIXA) , (NOA,DXA) , (DXCS/NOCS) > are actually matched/ as partially 
illustrated in Table 1. 

The greater sensitivity of the output to array nonuniformities 
(unbalance) in the actual complementary current-differencing section 
(DXCS/NOCS) of the circuit presented a substantial barrier to lower 
power operation as follows. As long as one assumes perfect matching 
for (DXCS/NOCS) r this pair of transistors may be made with an ex- 
tremely large width to length ratio, thereby reducing the associated 
voltage drop for a given current with a resultant lower power. At 
the same time, the greatly increased values for (DXCS/NOCS) make 
the CSRO circuit hypersensitive to threshold unbalances within that 
transistor pair. Indeed, to give acceptable sensitivity in the 
output to array threshold unbalances, not only must the size of the 
(DXCS/NOCS) pair be varied; but the sizes of (NSO) and (NR) must be 
changed, too. 


5-5 A _S ingle-sided Monolithic Sequential Processor 

In this section we continue the nomenclature used elsewhere with 
the "Balanced input" monolithic sequential processor to now describe 
the simplification of the processor enabled by the "CSRO" current 
differencing circuit. Whereas, in the "balanced input" monolithic 
sequential processor, the instantaneous difference between the two 
balanced halves of the multiplier was generated at the primary input 
charge metering position; this instantaneous subtraction function is 
now performed by the CSRO circuit to give £K = (K+) - (K-) with the 



TABLE 1 CMOS READOUT CIRCUIT SENSITIVITY & PERFORMANCE 
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final requirement of the form (SA - SB + SC - SD) . The alternating 
arithmetic signs were achieved in the "balanced input" monolithic 
sequential processor by steering the (K+, K~) data alternately to 
the opposite gates of the charge metering input, thereby requiring 
the addition of four such charge packets to obtain the needed 
combination-. 

The simplified monolithic sequential processor for use with the 
CSRO circuit, on the other hand, is illustrated in Figure 21 and 
requires the addition of only two charge packets to give the needed 
four part combination. The desired operation for the processor of 
Figure 21 follows: 1) Reset the clamp difference capacitor CD by 

activating gate C and reset COUT by pulse activating 0R. Activate 
gate 0 and repeat the charge metering operation twice so as to 
store V proportional to 2Q* on capacitor CD when gate C is deactivated. 
(Q* is the bias charge associated with the built-in potential (v) 
of the input charge meter: Q* = v g- # CIN.) 2) With 0 Q still activated, 
store the initial difference value SA on capacitor Cl and gate Gl. , 

Also reset COUT by pulse activating 0R. 3) Deactivate 0 to apply the 

second difference SB to gate G2 and Capacitor C2. Meter a packet 
of charge, Ql, onto capacitor COUT: Q1 = Q* + CIN * (SA-SB) . 4) Activate 

0 Q to store the third difference value SC on GI. 5) Deactivate 0 Q 
to apply the fourth difference value SD on G2 . Meter a packet of 
charge, Q2, onto capacitor COUT: Q2 = Q* + CIN • (8C-SD) . This 

yields a total charge increment stored on COUT specified by: 

QTotal = 2Q* + CIN * (SA - SB + SC - SD) , 

with an associated voltage appearing at the output of the electrometer 
amplifier stage. But V Q proportional to 2Q* was previously stored 
on CD so as to subtract from all subsequent outputs from the elec- 
trometer. THlife a signal proportional to 

t » 

QTotal - 2Q* = CIN * (SA - SB +SC - SD) (51) 

is sampled and held on the holding capacitor CH by pulse activating 
05 so as to update the analog output data. 




Figure 21 a SINGLE-SIDED (UNBALANCED) MONOLITHIC CHARGE- COUPLED 
SEQUENTIAL PROCESSOR 
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6.0 COMPUTER MODELING 


Various computer-aided-design (.CAD) routines have been 
employed to analyze many circuit possibilities. The most used 
routine is M-SINC (Modular Simulator for Integrated Nonlinear 
Cricuits) , a nonlinear dc and transient simulator of MOS transistor 
circuits developed by T. K. Young and r. W. Dutton of the Electrical 
Engineering Dept, at the Standford University. The M-SINC program 
features a second order MOS transistor model which includes specifi- 
cation of a field dependent mobility as well as three choices for 
the dominant mode of channel depletion. In addition to the transient 
analysis of integrated circuits driven by repetitive pulses M-SINC 
performs Fourier analysis of waveforms at selected nodes as follows : 
For up to 200 points, it predicts the output of the prescribed 
circuit resulting from specified input signals and driving sources . 

A discrete Fourier transform is applied to the time-domain sequence 
of computed output values to give both the phase and amplitude in the 
frequency domain. Purely sinusoidal input signals thus produce well 
defined output spectra with the undesired spectral components pro- 
viding a measure of the error or distortion from the desired function 
Such a spectral analysis technique for both computer and experimental 
modeling greatly facilitates pin pointing the dominant sources of 
errors as already described m sections four and five. Earlier 
"SPICE" CAD routines provide complementary data not readily available 
from M-SINC, such as swept frequency response and sensitivity to 
elementary parameters. 
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Consequently, we present the results of two typical comple- 
mentary models: one by M-SINC on a MOS circuit having all transistors 
of the same type ("Singular MOS") as illustrated in Figure 22, and 
a second one by SPICE for a CMOS version of the "singular MOS" circuit. 
Figure 22 includes only the identifiable circuit elements of a mono- 
lithic configuration and specifically excludes the differential 
current-to-voltage circuit of section 5.4 as well as fictitious 
circuit elements (like one ohm resistors), used to facilitate study 
of the circuits. In describing the following models, we point out the 
significant information from each CAD routine. The salient features 
of the quadruply-balanced triode circuit of Figure 22 will he detailed 
later in this section. 

6.1 M-SINC Models a "Singular MOS" Triode Multiplier 

The M-SINC program list for the circuit of Figure 22 is 
given in Figure 23 with that model given the label "MCISM-?-!' 1 . The 
constituent subcircuits are labeled by function as well as the input 
signal drives. The gate buffers are reasonably simulated with both, 

0.2 volt offset and a 20% gam differential. Since this particular 
computer run is an interim step m tracing the nature and source of 
errors, the triode multiplier source/drain buffer situation is the 
following. A 0.2 volt offset is applied to the source virtual-ground 
buffer; but both drain buffers were not yet changed from their per= 
fectly identical situation, thereby leading to a better than normally 
anticipated rejection of gate input feedthrough. Notice that large 
goemetries are used in the inverters and multiplying FET's to help 
give better dimensional uniformity within an array. The resulting 
quiescent operating point of the circuit is shown in Figure 24 where 
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Figure 23 MSINC Program List for a "Singular MOS" Triode Multiplier (MCISM-7-1) 
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Figure 24 The Quiescent Operating Point for MCISM-7-1 
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we see that all the transistors except the triode multiplying FET’s 
are clearly in the pentode regxon. 

Fxgure 25 illustrates the usefulness of the spectral 
analysis part of the CAD routine. Starting with the gate buffers, 
we see the primary gate signal falls at the fifth harmonic of the 
drain signal with its harmonics fallxng at the tenth, fifteenth, 

«* -r 

etc. multiples of the drain signal , The ratio five exhibits both 
the primary and secondary drain signal harmonics as well as their 
inter-modulation with the gate signal and its harmonics, while 
allowing a relatively inactive harmonic between each grouping of 
active ones. From the gate buffer spectral analyses we see rejection 
of harmonic distortion by the values of 53 to 58 dB and a slightly 
less than unity gain. Indeed the inverting buffer with output node 
(3) and the smaller input bias yields the better harmonic rejection, 
but also slightly less gain. 

The spectral analysis of the source follower buffers is 
also given m Figure 25. The loading of the triode multiplier on 
the drain buffer reduces buffer gain to -7.4 dB, much worse than the 
unity gain inverter. The nature of the virtual-ground source buffer, 
i.e. pentode operation with current proportional to (gate voltage) c 
squared, is very apparent: A fifth harmonic component current flows 

through the virtual-ground node due to a fifth harmonic voltage on 
the gate of the triode multiplier* The finite, nonlinear relationship 
between the virtual-ground node (47) voltage and the current it sinks 
causes the second harmonic of the gate signal to fall on the tenth 
order harmonic and to be as large as - 27.5 dB from the relevant 
fundamental. (The large second harmonic voltage contribution derives 
from the imposition of a "fairly clean" sinusoidal current while 
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Figure 25 MCISM-7-1 Spectral Analysis 
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Fxgure 25 MCISM-7-1 Spectral Analysis (continued) 
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Figure 25 MCISM-7-1 Spectral Analysis (continued) 
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spectrally analyzing the resultant source voltage.) On the other 
hand, with the source follower drain buffer, a sinusoidal voltage 
is applied to the gate while spectrally analyzing the source 
voltage to find a 49.9 dB rejection of the second harmonic in that 
case. 

Finally we examine the spectral analysis of the output; 
both the differential output and a single side of it. The cancella— 
tion of the gate signal component between the two sides results in 
that component subtracting in the differential output to a value 
42.5 dB below its value in one side of the balanced output, as also 
indicated in Figure 25. We must now use this part of the information 
for mating with the differential current to voltage stage described 
in the previous section. Examination of Table 1 shows the "CSRO" 
circuit readily provides subtraction cancellation exceeding 42.5 dB, 
thereby confirming the functional compatability between the (MCISM-7-1) 
and (CSRO-5-5) circuit. But use of the CMOS readout circuit suggests 
simultaneous use of a CMOS multiplier cell (like that described below) 
to take full advantage of the fabrication technology. 

6 . 2 SPICE Models A CMOS Triode Multiplier 

A CMOS version of the multiplier of Figure 22 is given in 

the SPICE CAD program list of Figure 26. As in Figure 23, we point 

out some of the key constituent subcircuits, which are consistent 

with the earlier self-bias scheme suggestion to use complementary source 

followers as gate buffers and depletion-mode PMOS source followers 

as drain buffers and source virtual-grounds. As indicated, a D in 

the name of a transistor shows that it is used in "dummy" or "current- 
« 

mirror" type subcircuits with only one such set of transistors per 
integrated chip in order to establish the various load device gate 
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Figure 26 SPICE Program List for CMOS Inode Multiplier (LCTBR-1) 


LOW POWER CMOS CORRELATOR 

HTTrl SELF 

3ALANCIN3 S RE 

A93UT t LCT3R- 1 1 

i 

DATE 

050577 

CLOCX 

1QSS10 

VERSION 


> 

> 

> 

> 


RN 8C 86 1E6 
RP 50 13 1ES 
RNG 50 8 6 1E6 
RFO 50 3G ISC 
R1 IE SI 1 
R2 16 02 
R3 76 23 
R5 76 05 
C36 36 0 
C96 36 0 
C53 23 Q 
CS7 S 7 0 
CSS 28 0 
Cl 3 18 0 
C53 58 0 
Cl 7 17 0 
C37 57 0 


1 
1 
1 

IF 

IF 

10P 

1QF 

10P 

SF 

SF 

IF 

IF 

IP 

IP 


(]> D designates "dummy" circuits, only one per 
chip to set follower load device gate biases 
much like "current mirror" function) 


— Depletion PMOST multipliers. 


= Complementary source-follower gate buffers. 


Drain buffers and source 
virtual-ground buffers. 


CIS is o 
C 76 76 0 
C3 3 0 1? 

CIO 30 0 IF 

H U 1 91 3 57 0 HFM3 J 
FF? 22 10 17 o FFMD 
MV3 9 3 10 5 7 0 KFMO 
FF5 25 3 17 0 FPHD_ 

HFIYL 3 5 3 5 5 H.N1 
FNIYL 10 53 5 5 MNl 
HO YL 53 S3 5 5 KM1 
KFIYA 7 2 3 5 FN1 
HNIYA 7 37 10 5 HN1 

HOY A 7 1 S3 5 PM . 

HFIXl 16 0 0 0 HFD1 2S 
FNIXL 76 C 0 0 Hr 02 25 
KNOL 57 0 0 D HFD1 25 

FFOL 17 C 0 0 FFQ 1 25 

HDXL 57 C C 0 HFD1 25 

KFIXA 36 'Z IS C MF02 25 
MHIXA 86 66 7 £ 0 HF 01 25 
FNOA 52 1 57 0 KFO 1 25 
HFOA 13 1 17 0 HFOl 2S 
FOX A 58 1 57 C FFD I 2S_ 

MFIXCS 36 53 5 5 MN1 25 
KMXCS 36 £8 5 5 KM 25 
MNOCS 5 3 S3 5 5 K?T1 25 

F r 0CS IS 58 5 5 MN1 25 

HDXCS 53 53 5 5 MM I 25 

•FOOEL FF31 FHC VTC=-2 FHX=.26 GCTA=3E-6 CAHFA=.7 LAFC0A=lS-2 RC:£Ct 

C3S-7E-15 C30=7E-15 CCB=2E-15 C39=.l° CaS-.l 3 ?3 = .5 15=13-15 

• F.CCCL KFFO F'-C VTC=-2 FHI=.S8 GET4=2E-6 CAM{ J { = .7 L AHCD A = IE - 2 RC = £CC 

• P.S=5G3 C35rl.51'18 CG3=1. 52-15 CG3=.lr C33=-l° C3S=.1^> P3=.5 IS=1E-15 

•KCDEL KFC2 FFG VTC=-Z.25 FHr=.E8 3ETA=32-£ U AKEO A - 1 S -2 - r 0 r 5 C C 

*PS- 503 C3S=7E-15 C00=72-1S CG0=2E-15 C23=.I? dS-.l* 3 J=.5 IS=1E-18 
•FCCEL CM MFC VTC-1 FHI=.7? E2T A=6E-E GAKKA=1.£ L#FCDA=2E-3 R0=2CC 

• RS-303 005=72-15 CS3=7E-15 CCa=2C-15 C93 = .2‘> COSr-Z 3 5 15=12-1" 

• KCDEL F*\2 NFO VT0=1.ZE FHT=.77 SET A=6E-E CAHFAiJ.E LAFEGA=EC-3 FD=3CC 
♦RS=303 C 3 5 = 72- 1 S C30-7E-15 CC3=?E-15 C33 = .2^ C9S=.2 P °3 = -5 IS=1Z-15 

• FCOEL FFE1 FCC VTC=1.S FHIr.Se SET R= 32- £ C-AHFAz.2 L AF^O A = 1 £ - 2 

• R3 = 5 (13 RS=500 CGS=7E-1S 053 = 72-15 CG3=2Z-18 C33=.l" C33=-1P P8 = .S IS =12-18 
VR 12 C CC -3 AC 1 00 K 

VYr 2 19 DC 0 5TN 0 100M 800X47 0 
VYK 37 0 DC -3 2 IN -3 -10CF 8CCXHZ 0 
VXP G 19 DC 0 SIN 0 1 Q OH 100KHZ 0 
VXN £6 0 DC -3 SIN -3 -1GCF 1CCXHZ 0 
VI 1 0 -3 
VC C 7 C DC -3 
V9D 5 3 DC -IQ 
VGG 50 0 DC -15 

.OUTPUT VnUTD Ifl 53 PLOT OC HA TR 

• OUTPUT VOLTF IB G FL C T OC FA TR 
•OUTPUT VD IS 0 PLOT OC NA 7R 

. OUTPUT VG 3 0 C LQT DC FA TR 
•OUTPUT V5F 17 0 PLOT OC HA TR 

• OUTPUT VFXCS 3 F C PLOT DC TR 

•OUTPUT VM1 16 91 =LCT OC TR 

• CUTFUT V F 2 16 22 FLCT DC TR 

•OUTPUT VM3 76 23 H OT DC TR 

.OUTTUT V F 5 76 98 PLOT DC 7R 

•DC OT VOUTD VR TC VR 2 -8 . 1 
.t C DEC 3 1PHZ lEaiW 
•TRAN .1U5 10US FOUR VOUTO 100KHZ 

• END 


ORIQJNAC KSS® SS 
OF POOR QUALITY 
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biases. The resultant circuit node quiescent voltages are tabulated 
in Figure 27 with the associated transistor quiescent values given 
in Figure 28, we hotice all transistors excluding th.e multiplying 
triodes are in the pentode operating region except the last five 
NMOSTs , which represent an early attempt at the differential read-' 
out circuit which ultimately developed into “CSRO.” A completely 
balanced model (LCTBR-l) is presented since the early readout scheme 
was very sensitive to array threshold nonuniformities. 

The available SPICE CAD routine provided the swept 
frequency response curves given in Figure 29. Thus, we may now 
correlate the subcircuit upper frequency limits with other criteria 
like power consumption and device geometry. First, we notice that 
the complementary - follower gate buffer appears usable to signal 
frequencies near 2 MHz, while it draws only 17.5pa from a 7 volt 
power supply for a total of 123 uW. with a transconductance of about 
20 micromhos {.consistent with the requirement predicted in section 
4.2). Note that, together, the two types of buffers provide a DC 
bias for the multipler of 4.5 volts to give CV QS - V T 1 =6.5 volts in 
LCTBR-l and 7.7 volts to give (V QS - V T )= 5.7 volts in MCISM-7-1. 

The acceptable harmonic error rejection values indicated in Figures 
25 and 30 show two important points. (1) Harmonic distortion by the 
buffers is acceptably low for all three types: PMOS drain-buffer 

source followers, PMOS gate buffer inverter, and NMOS {complementary) 
gate buffer follower. (2) The self bias provided by the complementary 
scheme gives multiplier DC bias adequate for good linear triode 
multiplication . 
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Figure 27 - LCTBR-1 Circuit Quiescent Points 
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Figure 28 The Quiescent Operating Points for LCTBR- 
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Figure 29 LCTBR-1 Swept Frequency Response 
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Figure 29 LCTBR-1 Swept Frequency Response (continued) 
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Figure 30 LCTBR-1 Differential Output Spectral Analysis 
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Additional data from the two typical CAD models are 
presented in Table 2 for easier comparison and analysis. Quite 
apparent is the reduced power consumption of the complementary 
follower versus the small inverter for providing comparable gate 
buffering. Also shown in Table 2 are the power economies enabled 
by the use of depletion mode PMOS in the CSRO and LCTBR-1 models 
versus the enhancment-mode MCISM-7-1 model. 


6.3 Performance As A Function of Design and Process Parameters 

In the above section, we saw the impact of depletion mode 
versus enhancement mode transistors on power consumption. Also 
implied was a dependence on device geometry. Consequently, further 
specification of performance limits the range of many of the device 
design parameters . 



TABLE 2 TRIODE MULTIPLIER BUFFERS' SPEED AND POWER 


CIRCUIT 


ITEM 

MC1SM-7-1 

LCTBR-1 

Gate Buffer 

PMOS INVERTER 

NMOS KXLCWER 

current, 

54.3 

17.5 

pcwer supply, volts 

20 

7 

pcwer consunption, niW 

1.086 

0.123 

transconductance, micrcmhos 

(22.6) 

20.5 

maximum frequency, MHz 

( 2.2) 

2 

Drain Source Buffers 

PMOS 

FOLLOWERS _ 

Drain Buffer Transistor Name 

Q45 

MPIXL 

geometry (W/L) 

28.6 

(40.34) 

current, /ia. 

100.7 

302.1 

(VGS-VT) , volrs 

1.20 

2 

transconductance, micronhos 

(168) 

301.1 

power supply, volts 

8 

8 

power consunption, mW 

0.806 

2.42 

maximum frequency, MHz 

(2.5) 

5 

Multiplier Triode (W/L) 

1 

(1.1) 

Buffer/Multiplier Ratio 

28.6 

37.5 

Multiplier "Self-Bais" 



(V GS - VT) , volts 

5,7 

6,5 

Source-Current Readout: current 

Ma. 

— 

power supply, volts 

- 

- 

power consumption for half of 



balanced triode multiplier, iriW, 

- 

- 

maximum frequency, MHz 

- 

- 


CSRO 


MPOA 

14 

134.7 

2 

104.5 

5 

0.674 

(1.7) 


151.2 

12 

1.81 

1 


(XXX) indicates data not predicted by CAD routine but extrapolated therefrom. 



6.3.1 Input Dynamic Range 


If one defines the input dynamic range as the ratio of 
the maximum to minimum usable signals as prescribed by the desired 
rejection ratios, then combining the results of the preceding 
sections (i.e., equations 30,37, and 39) give a direct formula for 
the input dynamic range in terms of chip geometry and voltages . It 
may readily be shown that the greatest input dynamic range is obtained 
when the ac drain signal is held constant at its minimum value: 

x = x (constant! = x . - m . (AV m l 

max - A mm T T 

(52) 

We then derive 

y n,ax - c y min = c * (A V <«> 

by substitution into equation (37).: 

*71v t ) ( r d /2 + ?)«(Y o - v T j - (1 + r a /2) ( av t ) 

or, for a singly-balanced triode multiplier, the direct output 
is governed by 

^c- V T m »S< AV T> {*< V 2 +f > + V 2 + • (54) 

Using equation (54) for the case of 0.1 volt threshold uniformity 

* 

and both if> and are to be 40 dB, with £ = 10, gives the result 

(Y - V ) _> 605 volts. This absurd result is further evidence of 
m 

the need for a special technique to ease the rigorous conditions 
imposed by equation (54). One final remark on equation (-S*-^ ) is 
needed here before we show how published experimental array data'*' 
support equation (d*-4 ). Prom Equation (5 4 it is obvious that the 
input dynamic range to a multiplier array, ? , as defined by equation 
(53) is severely limited, at best. The limited input dynamic range 
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of the multiplier array contrasts sharply with the average dynamic 
range of many imaging or simple analog delay CTD's which may some- 
times be as large as 80 dB. That is, the multiplier array cannot 
fully use the typical dyanmic range of the CTD analog delay lines at 
its input. 


An alternate definition for input dynamic range may also 
be considered. Above, the two 'minimum signals are defined as compar- 
able to the array threshold nonuniformities, (Ay^) , since sample-to- 
sample signal increments of that value would be difficult to distinguish 
from the comparable "voltage" increments arising from fabrication 
nonuniformities. The alternate definition for minimum input signal 
is the minimum input signal-increment needed to give signal detection 
at the output with unity signal-to-noise ratio. Thus, over and above 
the array nonuniformity problem (which is likely to be indistinguish- 
able from the signal) ; this definition is more complex and involves 
the specific nature of the output, especially its minimum detectable 
signal, as well as some effective device transfer function. Indepen- 
dent of the fine points of the definition of input dynamic range, we 
must still address the question "can a CTD/MOS triode correlator chip 
not requiring exotic fabrication ever become a practical reality?" 

In applying equation (-54) , we can no longer ignore the 
restriciton of equation (.30) which was not previously incorporated 
into equation (54) . We recall equation (30) relates the maximum ac 
signal on the gate to the gate harmonic distortion rejection^! 
as well as the geometric ratios of the buffer and the triode s 


y 


max 


g 


ir-B 


a m r g 


(30) 
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As derived earlier, the drain buffer is best configured as a source 
follower fed by a current source (CS) , thereby enabling us to give 
some additional useful relationships for equation (.30) . From the 
pentode equation; 


I = 



> 


C551 


it may be shown that 


2 I 


CS 


2 I 


CS 


’mB (V Gg - V T ) B (V GS - V T ) CS 


x 


(W/L) 


1 1/2 


B 


(W/L) 


CS 


or 


^mB ^ C ox 


V) . ('w'l 


B 


V 


cs 


1/2 


^ V GS “ V T ) CS 


Combining equations (30) and (57) gives 


’max 


(V flS V T ) CS 
r g " 


[<w/L) b (W/L) cs l 


1/2 


(W/L) 


m 


0561 


0571 


058) 


Substituting equation (58) into 053) yields a more useful description 
of the input dyanmic range, 5 : 


y 

J max 

y . 
J mm 


1/2 

(Vrs I V T>CS UW/L)r (W/L) C gl 
(av t *r g ) (W/ L ) m 


(59) 


Use of CMOS with depletion mode P-channel for lower power gives 
(V GS - V T ) CS ~ pinch-off voltage ~2 volts; and for AV T ~ 0*1 volt. 
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we find, 


( V GS " V t> 


T'CS 


AW 


20 , 


thereby -leaving the input dynamic range £ dominantly controlled by 
the buffer/multiplier triode geometry and the desired error rejection 
ratios. Having ref ormulated C into a more useful expression, we may 
do likewise for equation (54) , and at the same time incorporate 
additional factors to account for the benefits of the sequential 
multiply operator as well as doubly versus singly balanced operation. 

We define a new set of performance parameters, designated 
with a prime, which specify the desired performance at the output of 
the sequential processor and are related to the corresponding per— 
formance parameters within an equivalent singly balanced triode array. 
Table 3 concisely summarizes such a relationship for modifying formu- 
las (54) and (59) by means of a minimum expected enhancement ratio. 


TABLE 3 

PROJECTED PERFORMANCE ENHANCEMENT FOR A BALANCED TRIODE MULTIPLIER 



Symbol for Value 
After Sequential 

Expected Enhancement 


Equivalent Unprocessed 
Singly Balanced 

Item 

Processing 

Source 

Typical 

Minimum 


Gate Harmonic 

Distortion 

Rejection 

r g 

Double vs 

Single 

Balance 

10 dB 

6 dB 

I’g = ir'g) /2 

Drain Harmonic 

Distortion 

Rejection 

>'d' 

Sequential 

Multiply 

Operator 

10 dB 

6 dB 

r d = Ir;, 1/2 

input Feedthrough 
Rejection 


Sequential 

Multiply 

Operator 

60 dB 

50 dB 

= t’4'') /316 




Substituting from Table 3 into (54) and (59) we find 


( V GS - V T>CS [(W/t >B (^CS 1 lA 
(AV T ) (W/L) m 


r = (632) 


(60) 















( 61 ) 


(Y - V 
o T 

~~ avT 


) 


m 


> 3. 16 x 10~ 3 $ < r d /4 + 0 + r d /4 + 1 


These formulas now enable us to use an attractive 4Q-dB 


rejection ratio for each source of error ^ = I*" = 100^. to 

predict the resultant input dynamic range. For the low power CMOS 

with depletion mode P-channnel buffers, a reasonable range for 

(3f O ' V T > is 5 to 7 volts with an approximate value of 0.1 volts 
m 

for AV , thereby giving an estimate of 


f = 


316 

Y 


Y -V, 
o 

AV„ 


- r./4 - i 
d 


- r a /4 


(62). 


or z, - 82 or 38.3 dB. This result, however, cannot exceed the 
restriction of equation (60) . A typical pinchoff for our depletion 
mode P-channel is about 2 volts, thereby yielding 


[(w/l) b (W/d cs ] 1/2 


(W/L) 


= 65. 


(63) 


m 


6.3.2 Power and Speed versus Size 


Design decisions made at this point strongly influence 
the power consumption per multiplier, since it affects the main 
power users m the multipliers, namely, the source and drain buffers 
The 38-dB input dynamic range arose from a triode bias that can be 
achieved from the same complementary buffers mentioned m the 
description of the pentode squaring multiplier, and partially 
incorporated in the LCTBR-1 model so as to give a 4.5 volt dc 
gate bias augmented by a 2 volt pinch-off. Now for the drain- 
source buffers to meet the condition of equation (63) , we note 
that a 2-volt pinch-off P-channel FET with gate and source con- 
nected conducts about 12 yA for unity (W/L) , so for (W/L) = 65 
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the current becomes 780 yA for a minimum power consumption of 3.9 
mW per buffer. 

An attractive option, however, is to use a smaller current 
source with a larger source-follower buffer. While this saves power 
in direct proportion to the reduction of the current source, the 
size of the buffer rapidly becomes unmanagable, A potential com- 
promise is to reduce the current source by a factor of three while 
increasing the buffer by three times so as to still satisfy equation 
(63). This results in a drain buffer power of 1.3 mW (that is, 

260 yA at 5 volts) while the frequency response of this buffer extends 
to nearly 7.5 MHz. Allowing for a 5 micrometer source drain spacing 
after diffusion, the net transverse dimension becomes 0.0384 inch, 
which is a rather sizable MOSFET. 

6.4 Performance Enhancement Techniques 

So far much of the discussion and derived formation involves 
singly-balanced triode multipliers as illustrated m Figures ( 7 ) and 
{ 8 ) , with occasional references to more highly balanced configura- 
tions as in Figure (22) . As long as the complementary buffers 
provide adequate bias to the multipliers, more distortion errors 
arise from the pentode mode buffers carrying the multiplier currents 
than from the nonlinearities of the multiplier triode itself. 

6.4.1 Double-Balanced Triode Configuration 

In Figures (7 ) and ( 8 ' , the reference 1 triode, QR, is 
given a dc gate bias which is equivalent to creating a fixed conduc- 
tance equal to the dc average value of the modulated conductance, 

QM. Since the currents through the two conductances are differenced. 



the large components of the currents due directly to the signal 
applied to the common (drain) conductance node should cancel, 
leaving only the product current arising from the modulated 
conductances, as implied in equation C26) . Now, if an ac signal 
shifted 180° in phase from the ac component on the gate of QM is 
added to the dc already on the gate of QR, we observe two important 
improvements in the multiplication. 

First is a doubling of the desired product current 
since subtraction of currents through conductances which are modu^ 
lated 180° out of phase gives constructive addition of the two 
components. Note the dc conductances are still the same and help 
cancel a direct feedthrough of the signal applied to the multiplier 
drain. 'But with the second conductance modulated 180° out of phase 
from the first, the incremental currents related to the multiplier 
gate signal nominally cancel at the common (drain) conductance node 
driven by the drain buffer. Or equivalently, in order to apply 
equation (30) , we may describe the balanced (push-pull) gate opera- 
tion as nominally cancelling the effective ac voltage to a much 
smaller value, thereby increasing the rejection of gate signal 
distortion. 

The benefits of such nominal cancellation are clearly 
listed in Table 4 which is derived by MSINC computer-aided-design 
(CAD) modelling of the various multiplier circuits . The four 
parts of the Table give the spectral analysis of the output for 
basically identical triodes, buffers, and biases configured first 
as singly balanced, then doubly balanced, and finally as quadruply 
balanced multipliers. Figure 22 illustrates the important elements 
needed to give (1) self -biasing implied by the "dummy" reference 



TABLE 4 


COMPARISON OF TRIODE MULTIPLIER CONFIGURATION CONCLUDING BUFFERS) 


Configuration (MCISM) 

Single Bal (4-4) 

Double Bal (6-9) 

Quad Bal (7-1) 

Quad Bal (7-2) 

Input. (mV) 

Drain - F D = 60 KHz 
Gate - F g = 300 KHz 

0 DC; 125 AC 
0 DC, 125 AC 

200 DC; 125 AC 
200 DC; 250,-300 AC 

+200, +200 DC, + 125 AC 
200 DC; +250,-300 AC 

+200 DC; 125 AC 
-200 DC, -150 AC 
200 DC; +250, -300 AC 

Output (Fourier Coef) 





(mV) F g -F d 

244 

982 

1023 

1139 

f g + f d 

208 

838 

922 

1028 

RSS (F Q + F D ) 

321 

1290 

1377 

1534 

f d 

174 

851 

834 

929 

f g 

106 

2730 

0 629 

3159 

2f D 

0.704 

2 266 

0 629 

1.823 

2F g 

1 042 

2 908 

0 838 

3.803 

RSS (2 F G iF D ) 

1 937 

3.280 

1 752 

3.879 

RSS (F G +2F D ) 

1 004 

3 486 

.950 

3.900 

Drain Harmonic Rejection 

53.2 dB 

55 1 dB 

66 8 dB 

58 5 

Gate Harmonic Rejection 

49 8 dB 

52 9 dB 

64 3 dB 

52.1 

RSS^F g ,2 F g +Fq) Distortion 

43.3 dB 

49 4 dB 

57.0 dB 

49 0 

RSS (2F 0 ,F g ±2 F d ) 
Distortion 

48.3 dB 

49 8 dB 

61 4 dB 

51 0 

Total RSS Distortion 

42 1 dB 

46 6 dB 

55 7 dB 

46.9 

Input Feedthrough 
Rejection. Drain 

5.3 dB 

3 6 dB 

4 4 dB 

44 

Gate 

9.6 dB 

- 6.5 dB 

66.8 dB 

- 6.3 
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level, X' q , and (2) on-chip product- cur rent-summation at internal- 
voltage-controlled, ac-virtual-ground node with current-to-voltage 
conversion by means of the "grounded-gate" source-input stages 
having the "dummy" reference level X 1 on their gates and operating 
in the pentode mode so that voltage fluctuations on the drains twhich 
are connected in common as indicated in the figure), have negligible 
effect on the virtual ground nodes. In fact, in the CAD simulation, 
the summation busses were biased through. 2QK ohm resistor to convert 
current to voltage with the differential output voltage equal to the 
voltage difference between the two pentode drains connected to the 
summation busses. Unneeded elements were simply disconnected for 

the single and double balanced simulations. 

/ 

In Table 4, all ac values are Fourier coefficients giving 
the center-to-peak value of the sine wave. The input dc va,lues are 
the quantities (X Q - X' Q ) , (X" Q - X' o ) for the drain and (Y q - Y' ) 
for the gate, and are intended to approximate a worst-case array 
threshold nonuniformity which may develop at any individual multi- 
plier due to the self-bias technique or the user/operator bias 
routine described earlier. No such threshold nonunif ormities were 
incorporated m the singly balanced model, so we must assume those 
output results to be the best performance limit. Thus, in the first 
two columns, we are really comparing "best case" of singly balanced 

versus "worst case" of doubly balanced, with the gate signal more than doubled for 
the latter case. 

First, the four fold increase in cutout signal is obvious : 
twice for the larger input signal and twice for the double balancing. 

Secondly, rejection of distortion at (2 f_, 2 f_+ f_)is improved by 

(j G — u 



6 dB despite a more than doubled gate signal and a dc source/drain 
bias, both of which should have rejection of that distortion worse 
in a singly balanced multiplier. 

Consequently, the .doubly balanced version appears to 
have a significant performance adv a ntage of about 10 -dB harmonic 
distortion rejection over the singl e balanced _m.pd.eJL, at no extra 
c o s t, o f . J p , pwen^ . ^or,- , ,c^p2exify^,Mitbj-ri Jb.heJqa.S-i.c. . multiplier cell . 


The true added cost of the better performance is the 
differential input stage with balanced outputs feeding two rather 
than one CTD analog delay line. 

An important comment on achieving the balanced 
or push-pull gate signal is needed here. Use of a differential- 
input balanced output circuit at each and every multiplier location 
has been previously investigated both .at Westmghouse and elsewhere. 
It is our conclusion that such an approach not only adds extra 
complexity, real estate, and power consumption at each multiplier 
location, but also is likely to result in greater nonuniformities 
across an array. A single serial differential-input, balanced- 
output circuit feeding two delay lines with source-follower buffer 
is more likely to give uniform parallel outputs than an array of 
dif-amps (whose balanced outputs resemble the output drains of 
MOSFET inverter stages) with nonuniform inputs and current sources. 

6.4.2 Quadruply-Balanced Triode Configuration 

In the preceding configurations , a single drain driver 


applied a signal to the common drain node of a matched pair of 
conductances which are modulated by a suitably applied gate signal . 




That is, the Complementary) y-related "gate-signal" currents may 
be pictured as circulating in the QM/QR loop of Figure 13, which 
becomes a closed loop by means of the virtual-ground source buffers 
connected to the two triode sources, with only the uncancelled/ 
differential constituent of the y-related current flowing in the 
drain buffer. The next level of balancing requires the addition 
of a second drain driver and, a second set of matched MQSFET triode 
conductances, as shown in Figure 22, with their sources cross- 
coupled to the two previously-used, self-biasing ac-virtual-grounds . 
As with the doubly-balanced triode configuration, the (complementary) 
x-related, "drain-signal" currents may be pictured as circulating 
between the two oppositely-phased drain buffers through the two 
pairs of triodes with only a small unbalanced differential component 
flowing through the virtual-ground source buffers. The second drain 
driver is associated with a second CTD analog delay line transporting 
a signal of opposite phase relative to that in the CTD shift register 
which supplies the first dram driver. As with the balanced gate 
signals, the "push-pu-ll" operation for the two drain drivers is 
easily achieved via a single differential-input, balanced-output 
circuit feeding two CTD shift registers. In this case, however, 
the second shift register was already previously used as the dummy 
reference level element and requires only minimal change to provide 
the needed "push-pull" signals to the two drain drivers. 

Examination of the results in Table 4 for the quadruply 
balanced configuration show nearly 10 dB additional total harmonic 
distortion rejection for the case when the equivalent dc offsets 
are the same and the ac signals are equal and opposite. The more 
likely case is that both the ac and dc components will be different. 
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It may indeed be shown that the input feedthrough rejection of the 
gate drains are opposing. On the other hand, the output signal did 
not double relative to the output of the comparable doubly balanced 
configuration. Thus, Table 4 does not demonstrate an obvious vastly 
superior performance for the quadrup-ly baianced configuration. Con- 
sequently, addi tonal computer simulation and breadboard investigation 
are still needed to evaluate the merits of the quadruply balanced 
multiplier, since each basic multiplier cell contains an extra buffer 
and triode pair not needed in the simpler version. 

6.4.3 Sequential Multiply Architecture 

The general architectural requirements for the sequential 
multiply were presented in Figure 9 through 12. Namely there must 
be interlaced storage for both reference and data in both signal 
channels. For the self-bias of triode multipliers, there must be 
additional" reference-only" cells which are used to set the voltage 
for the ac-virtual-ground current summing. • All these requirements 
have been incorporated in Figure 31, 32, 33 and 34, which may be 
regarded as block diagrams for the various attractive multiplier 
configurations . 

To provide for a valid self-bias reference level needed 
at all times by the singly balanced (Figure 31) and the quadruply- 
balanced (Figure 34) triodes, both configurations use the sample/ 
hold buffered tap described in Figure 2. Fully synchronized analog 
data availability in both the double balanced triode (Figure 32) 
and the dual pentode (Figure 33) permit virtually any type of NDRO 
tap. This feature in the doubly-balanced triode is achieved by 
means of a full "dummy" reference only channel solely for averaging 
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over the array to set the self-bias. In all cases where "push.-pull" 
balancing is used (i.e. all except for the singly-balanced triode 
of Figure 31) , a serial input differential stage with balanced out- 
puts is used to feed two symmetric shift registers for each such 
channel. In such push-pull channels, complementary data are inter- 
laced with reference-only samples and move together through the 
registers as indicated in Figure 32. 33, and 34. 

A final comment on the use of balanced configurations with 
the serial multiply operator: the previously presented theory based 
only on a single (not paired') triode* suggests that such a single 
multiplying element could give a usable output product when processed 
by an ideal sequential multiply operator. The real limitation, 
however, is the accuracy of the four sequential additions/ subtractions . 
Consequently, coarse "geometric" balancing is used in all cases to 
provide an initial approximate cancellation so as to relax the accu- 
racy requirements for the sequential arithmetic. 

6.5 Performance Comparison:' Single Multiplier Versus Multiplier Array 

So' far we have examined many of the components of CCD- 
based correlators as well as some of their interactions. We will 
now discuss the probable characteristics of an array of such com- 
ponents as illustrated m Figure 31. Buffer stages indicated by 
triangles are essentially like those included in the circuit Of 
Figure 22, where the self-bias, ac-virtual-gf ound readout buffers 
of Figure 22 correspond to the triangles containing the algebraic 
signs. While alternate means can be used to achieve the self-bias 
function, the approach implied in Figure 31 accomplishes array 
threshold averaging and is also amenable to the slight change needed 
to incorporate other necessary schemes required for desired perfor- 
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mance • Thus, for the remainder of the technical discussion, the 
architecture of Figure 31 will be the minimally assumed configuration. 

In order to better understand how an array like that of 
Figure 31 operates, we must first describe the behavior of the 
array in terms of the previously described behavior of its constitu- 
ents , 

— Hence, we define an analqg multiplier cell building block 

to consist of those CCD analog delay stages with suitably buffered 
nondestructive readouts appropriately interconnected with multiplying 
elements so as to provide all the necessary simultaneous (parallel) 
analog data to a common Con-chip) output-processing circuit which 
completes the computation of a true four-quadrant multiply Lto 
include dc restoration as needed) and gives a fully-reconstructed, 
step-wise output waveform. 

For example, the circuit of Figure 22 with the addition 
of the associated CCD delay stages forms the building block for a 
quadruply-balanced triode multiplier cell. 

Previously, we observed that any such multiplier cell is 
most critically characterized by spectral analysis of its output. 

The relative amplitudes of the various spectral components of the 
output (when two sine waves are applied inputs) readily pinpoints 
such problem areas as lack of detailed balance, excessively large 
signals or other harmoniq/intermodulation distortions arising from 
inadequate buffer conductance, etc. But, this analysis tool must 
be corrected for use with arrays like that of Figure 31. From the 
CAD simulations presented in Table 4 we recall the Fourier components 
of interest included: f f G , f G + f D 2f G , 2f D , 2f G + fj-,/ and ^D' 
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where f and f are the frequencies applied respectively to the 
drain and gate of the triode multiplier. To see the effect on 
each Fourier component, we first examine an array of ideal multi- 
pliers as illustrated in Figure 1, From sampled data theory we 
have 

x k =x sin 2*^ (t -kTJ 

y k =y sin. 2irf G (t-ltT c ) (64) 

where 1 <_ k <_ N is the index within the array and T is the data 

i 

sampling interval. Hence, the ideal output becomes 

N 

z(t) = X) xy sin 2 tt f D (t-kT c ) sin 2 tt f G (t-kT c ) (65) 

k = 1 

Rewriting , 



Hence, the general summation is of the form 

N 

S = £ cos 2irf (t-kT) (67) 

k = 1 

To simplify the analysis, we approximate the discrete summation of 

equation .(67) by an integral over the same time period, letting g= 

t-kT and dg=-T dk, to obtain 
c c 


t-N T c 

S ~ (-1/T ) f cos 2 it fg dg 
t-T 

c 


( 68 ) 


Thus S = (2-rrfT^) ^ [sin 2ir f (t-T ) - sin 2 tt f (t-N T c ) | 


Rearranging , 


Sa- (Trf T ) 


-1 


sinir f (N-l)T cos 2-rt f (t- 


N+l 


T ) 
c’ 


(69) 
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or 


N 

s = E 

k = 1 


cos 2n f(t-kT )ss - (N-l) 
c 


sin it f (N-l) T 


7T f (N-l) T 


cos 


2 * f(t- ^ T c j 


(70) 


Now we may apply the result of equation (70) to either the ideal 
multiplier array of equation (66) or the more realistic predic- 
tions of Table 4. The output spectrum of such an array will have 
different amplitudes approximately adjusted by the factor 


(N-l) 


sinirf (N-l) 
TTf (N-l) T c 


(71) 


where the array contains N cells and f is the frequency of the 
spectral component of interest. Let us consider a hypothetical 
example where a 10 nsec sample interval is used while the inputs 
to a 32 point array are sine waves with the frequencies, f^=6kHz 
and fg =7kHz. The resultant relative correction factors from 
equation (71) relating array spectra to Average cell spectra become: 


Fourier 

Component 

f G~ f D 

f D 

■ f G ' 

2f D 

! f G +£ D 

! 2f c 

Frequency 

1 kHz 

6kHz 

. 7kHz > 

12kHz 

13 kHz 

‘ 14 kHz 

Adjustment 

1.42dB 

22 . 75dB 

22.54dB 

23. 62dB 

42.58dB 

23. 84dB 


That is, if the average constituent cell had an input feedthrough 
of f D comparable to the sum or difference frequency, in the array 
output that feedthrough would appear to be (22.75-1.42) = 21.33 
dB below the difference frequency output due solely to the sampled 
data correction factor of equation (71) . Thus, formula (71) enables 
diagnosis of array spectra data into that for the Average cell so 
as to pinpoint problem areas of inadequate performance . 
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7 . 0 DELIVERED HARDWARE 


During the course of the contract a few hardware items 
have been fabricated and delivered. In this section, we first 
briefly describe the various delivered items, followed by a summary 
of the published performance data. 

7.1 MOSFET/CCD "Building Blocks" 

To experimentally investigate the various sources of 
distortion and/or offsets as well as other circuit performance 
characteristics, a general purpose array of sixteen PMOSFET's 
were fabricated on a CCD chip. The metallization pattern for 
this array is illustrated in Figure 35.' The MOST’s have a variety 
of width to length ratios (W/L) , indicated by the number adjacent 
to each gate. The following distribution of devices, was selected; 

8 with W/L = 5 
4 with W/L = 10 
2 with W/L = 50 
2 with W/L = 100 

The serial in/parallel out analog delay lines initially used 
were those developed at Westinghouse for use in various CTD discrete 
analog signal processing programs, such as some supported by the 
Naval Research Laboratories (contract number N00014-75-C-0283 and 
N00173-76-C-0147) , and described m the IEEE ISSCC-76 Digest of 
Technical Papers (pp. 194-95) . 
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Figure 35 CCD 
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7.2 Feasibility Demonstrator for Sequential Multiplication 
with Self-Bias 

The "Sequential Mulitply" , schematically pictured in 
Figure 36, is designed to demonstrate a four step sequential 
multiplication of two signals. The exerciser is intended to be 
a feasibility demonstration unit and as such contains all necessary 
power supplies and a crystal clock. The control circuitry for the 
two CCD's and the sequential multiplier is contained on three cir^ 
cuit boards. One board generates all of the digital control signals 
Another board contains the regulators that supply the various voltage 
levels to the CCD and multiplier. The third board is the analog 
processor. 

The digital board shown in Figure 37 has a crystal controlled 
oscillator which provides the basic frequency for the control logic. 
This frequency is divided by counters B and C, Counter B divides 
by 12 and counter C by 16. These two counter address 3 PROMs. One 
PROM (D) provides the timing for the X channel CCD. An identical 
PROM (E) provides the timing for the Y channel CCD, The 
waveforms for the X and Y channel PROM are inverted to provide the 
necessary phase difference between the two channels. The third 
PROM (F) provides the signals needed to do the clamp, sample r and 
difference operations in the analog processor. Two signals (.that 
control the inputs to the CCD's) are decoded directly from the 
counters. 

All of the control signals from the ROM’s are strobed into 
flip-flops (G,H, J,K,L,M) to remove spikes associated with the decoding 
in the PROM's. The outputs of these flip-flops go to drivers which 
convert the logic level signals to higher voltages needed to operate 



1 04 



Y INPUT 


X Y 
OUTPUT 


77 0321 VA 2 


Figure 36 Sequential Multiply 

Exerciser 



h - 1 

O 

Ln 






Figure 37 


Sequential Multiply Exerciser 
Digital Board* 


^ WhA, 

ihMv, 

7/7/76 





106 



Sequential Multiply Exerciser 
Regulator Board 


Figure 38. 
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MOS devices. 


The regulator board of Figure 38 contains all of the 
regulators which provide all of the voltages used by the analog 
processor board shown in Figure 39 . All of the regulators have 
a variable output voltage and are powered by the two main 20v 
supplies . 

The analog processor CAP) board has the two CCD's and 
the sequential multiplier circuitry. This board also contains 
the drivers for the signals that come from the digital control 
board, with passive component carriers illustrated in Figure 40 . 

Each of the two inputs from the front panel to the mul- 
tiplier is conditioned before it is fed into a CCD. Each input 
to the multiplier is passed through a capacitor to remove any 
DC level associated with it. Two CMOS switches are used to switch 
between the input and a reference bias for the CCD. This signal 
is buffered by an op amp and applied to one of the input gates of 
the CCD. Such initial processing simulates the input processor 
sections of Figures 31 through 34 . 

Two output taps from each CCD are used m performing the 
multiplication. One tap from the X channel CCD is used to provide 
a reference to the virtual-ground source buffer. This signal is 
conditioned by op amp K. The magnitude and offset of this 
signal can '‘be changed by two pots. The gain and offset adjustments 
to facilitate simulation of both matched and mismatched quiescent 
points for the triode source and drain buffers. (Due to the Design 
of the CCD outputs, there are differences in gain and offset voltage 
between odd output taps and even output taps , ) The output of op 
amp K is fed to the non-inverting inputs of op amps L and M which 
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provide virtual-ground nodes for the sources of the multiplying 
transistors in the manner shown in Figures 8 and 14-a. The 
signal from the other output tap of the X channel CCD is applied 
to the drains of the multiplying transistors* The signals from 
the Y channel CCD outputs are passed through an RC network to 
introduce a DC offset since ''complementary" buffers for multiplier 
self-bias were not readily available. This provides the needed 
gate to source bias voltage for the multiplying transistors. 

The two current outputs of the multplier are fed to op amps L and 
M where the current is converted to a voltage. The outputs of 
these two op amps are feedthrough resistor networks to op amp N* 

This op amp is set up to eliminate the common mode signal due to 
the X channel reference and to amplify the difference between the 
outputs of L and M which represent the product signal. 

Following op amp N are 2 op amps r 5 switches, and 3 
capacitors which clamp, sample, and difference the signal and 
store the processed signal on Cg . At the beginning of the processing 
sequence, switch F13 is closed and the first product is stored on 
C^. Switch F13 is opened and when the second product is available 
from op am N, the difference of the two products is at the output of 
op amp P. Switches F5 and F12 are closed and the difference of the 
first two products is stored on C g . Switch F13 is closed again and 
the third product is stored on C^. With switches F5 and F12 open, 
switch F6 is closed. Switch F13 is opened and when the final product 
is available at the output of op amp N, the difference of the second 
two products is at the output of op amp P and the result of the 
four products is at the output of op amp R. This result is the sum 
of the two differences. Switch G13‘ is closed and this result is stored 
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on capacitor C g ,, This signal is amplified by factor of ten and 
offset back to ground by op amp T. This signal is available on 
a connector on the front panel of the exerciser. 

The multiplier operates best with a drain to source 
bias of 0 volts. This voltage may be adjusted by the two pots 
that control the offset and gain of op amp K, The gate to source 
bias for the multiplier should be approximately -4 to ~7 volts. 

This voltage can be changed by adjusting the V qj and pots 

located on the regulator board. The voltage on the gates should 
be equal for matched transistors. For poorly matched transistors, 
this voltage may be adjusted to improve operation. The voltage 
used’ in the clamp/sample circuit should be set to 6 to 7 volts 
which is half of the voltage to the CMOS switches. The product 
signal is available on the X Y connector. A sync signal controlled 
by the internal logic is available to sync an oscilliscope to the 
internal sampling rate. One demonstration of the multiplier is 
using sine waves for inputs and observing the sum and difference 
frequencies at the output. Good results may be obtained by using 
a sine wave of 1.5v p-p @200 Hz on the X input and 2v p-p @ 1300 
Hz on the Y input. The effective sampling frequency of the multiplier 
is approximately 10 KHz so input frequencies whose sum is less than 
5KHz should be used to avoid aliasing problems. The input signals 
mentioned are only examples of typical frequencies and amplitudes for 
this particular hybrid model of the sequential multiplier, due 
primarily to the electromechanical construction and not limitations 
from the CCD/MOS circuits used. 
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7.3 A 16-Point-Correlator Hybrid Feasibility Demonstrator 

The objective of the second demonstrator equipment is to model 
as accurately as practical in hybrid fashion the most attractive "Sum 
of Products" device based on the results of all the preceding 
theoretical and experimental investigations, with the constraint that 
the model be available in approximately four months for test and 
evaluation during the summer quarter of 1977. The hybrid approach 
not only permits observation (by monitoring with a scope probe) of 
every node within the "Sum of Products" device model, but also allows 
slight extra flexibility in component selection. Furthermore, 
even minimal monolithic integration of a complete "Sum of Products" 
device would have required substantially more resources than available 
for the hybrid model. This situation led to the following rationale 
and critical design decision concerning the hybrid model. 

A. Analog Delay Lines With Independent Nondestructive Readout 

The previous model of the sequential multiply operator had 
featured Westinghouse CCD's of the serial m/parallel out configuration 
with integrated emitter-follower buffers. These 20-tap devices 
functioned best with fairly complex clock waveforms and initially 
suffered from a symmetry- induced, odd-even fixed pattern. Since more 
than ten uniform taps were' needed, a commercially-available 32-tap 
tetrode-gate bucket-brigade device was selected for the analog delay line, 
so as to permit comparison of BBD performance with that of CCD's 
in view of the potential for simpler fabrication, operation, and 
user interface. 
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B. Multiplying Elements 


Even though the very lightly doped CMOS-pentode difference of 
squares multiplier appears most attractive; such CMOS arrays are 
no.t available either eommere-i-a-liy or i-nterna'lly without the need for 
some mask set and/or fabrication process development, thereby 
forbidding their use m the allowed time period. The next most 
attractive scheme is the doubly balanced MOSFET tnode multiplier 
that needs a nominally identical pair of MOSFET* s. Here, fabrication 
of such 'arrays internally promised to give the desired matched pairs 
in the allotted time while commercial suppliers asked for much longer 
delivery times to achieve the target matching. Indeed, when measured 
at their anticipated quiescent operating points, many of the 
Westinghouse MOSFET pairs matched to within one percent for "resistance" 
as measured on transistor curve tracers. 

i 

C. Buffers 

The MOSFET arrays of the proper type and size (adequately large 
relative to the multiplier MOSFET pairs) were readily available 
commercially and used throughout the hybrid model. 

D. Sequential Multiply Operator 

Even though existing Westinghouse CCD's have sufficient flex- 
ibility to permit a hybrid model of the monolithic implementation 
of the sequential multiply operation, a high probability of success 
in the limited time allocation dictated continued use of the hybrid 
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technique of the first demonstrator because of uncertainty in "debugging" 
the model of the monolithic sequential operator (not previously 
executed experimentally) . 


E. Differential Current to Voltage Readout {viz. CSRO) Circuit 

Strict adherence to the circuit "recipe" given in Figures 15-17 
suggests the use of three more transistors per multiplier comparable in 
conductance to those in the source-drain buffers plus other smaller 
MOSFETS . Thus at least sixteen more "DIPS" would have to be added 
to an already large and crowded hybrid model circuit card. Con- 
siderable experience with the bipolar version of the CSRO circuit 
as well as adequate current handling capability with much fewer 
"DIPS" dictated the use of the bipolar version of this subcircuit. 

The preceding critical design decisions were then incorporated 
into a block diagram essentially identical to Figure 36, except the 
single multiplier was replaced by an array of 16 multipliers. The 
circuits and related wiring details are presented in their entirety in 
the Appendix. 

7 . 4 Published Performance Data 

2 

A partially integrated device with 32 singly-balanced 
triode pairs was operated at a sampling rate of 100kHz with sinu- 
soidal inputs of f D - 6 kHz and £ = 7 kHz. The array was operator 

biased using a spectrum analyzer to simultaneous ly, minimize both input 
feedthroughs as well as the harmonic distortion. For the array 
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operation, the amplitude of the undesired Fourier components fell 
in the range of 30 to 40 dB below the difference frequency. Adjusting 
the array data to reflect the average performance per multiplier 
gives rejections of 10 to 20 dB or equivalently 

* s r a * 6 

The triode gate bias was approximately 2.5 to 4.5 volts. The gate 
signal could not be changed without degrading either 41 or r results 
ing in ? =1 to give an estimated array nonuniformity of A 89 

to 160 m volts, which agrees with typical MOS array results and 
supports the claim of equation C54) . Furthermore, these results 
strengthen the questions: Can CTD/MOS triode multiplierarrays be 

fabricated to give better performance than the marginal values cited 
above? If so, what performance and voltage or power levels can be 
expected? 
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3 

In publications, by Westinghouse investigators ( more 
data was presented ; 6n’ the 'problem areas of the basic multiplier 
building block with emphasis on a serial (or common) technique 
external to the. parallel array, which could relieve at least 
one of the severe restrictions implicit in equation C.54!, Thus, 
the sequential multiply technique relies on four constituent product 
currents for each four quadrant multiply which are generated with 
the same set of MOSFET devices sequentially in time* When these 
constituent product currents are combined algebraically, the effects 
of the nonuniform thresholds nominally cancel completely, thereby 
virtually eliminating the requirement on input feedthrough rejection, - 
ijj, as well as reducing drain harmonic distortion (as shown m 
equations 40 through 45). Other schemes, such as the doubly-ba lanced 
technique already described, also help ease the restriction on 
harmonic distortion rejection, r^. 

Results of the Westinghouse developments are illustrated 

in Figures 41 through 43. Figure 41 shows the spectrum from a 

singly -balanced triode pair before sequential processing. This is the 

. result expected for each singly-balanced triode pair, on the average, 

before parallel summing and sequential processing (jthe latter, to 

cancel effects of threshold nonuniformities and reduce input feedthrough!, 

2 4 

Note here, all published existing partially integrated devices ' 
are equivalent to the parallel summation of a certain number of singly 
balanced triode pairs. Furthermore, m keeping with CCD compatibility 
objectives, two similar but not matched MOSFET triodes from an earlier 
CCD chip test pattern were used to generate the results of Figure 41 
with the following optimal biases giving the best drain input feed- 
through rejection: Y o =V GS =4V, , V SB = Source Body Bias = 11.8V/ v D g = 

51mV, AV g = 142mV, x=y=200mv (pp.) , and y could not be changed without 
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degrading the values indicated in 
drain input feedthrough rejection 
rejection. ' 


the photo tv iF 0 , 8 for the 

and of P 150 for the harmonic 
d 


Despite the less than ideal simulation of nominally- 
matched, CCD-compatible triodes, when similar MOSFET doublets were 
used in the published hybrid model , the results shown in Figures 
42 and 43 were obtained. In the hybrid model the previously 
described self-bias scheme was used rather than the selected bias 
adjustment used for Figure 41. Due to the nonoptimum biases, the 
effective array drain input feedthrough rejection was about —11 to 
-17 dB for a ^ of approximately 0..2. Thus, from a rather poor 
starting point, Figures 42 and 43 show the sequential processor 
achieved a final input feedthrough rejection of 41 dB. for a total 
compensation or cancellation of threshold nonuniformities of nearly 
58 dB., by means of a hybrid sequential processor mounted on convene 
tional wire-wrap boards, and’ implemented primarily with single 
chip monolithic bipolar op amps and complementary MOST switches 
frequently used for special purpose CMOS digital logic * Furthermore, 

the same data just cited for the output prior to sequential processing 

* 

also shows a drain harmonic rejection of only 30dB in contrast to the 
photo of Figures 42 and 43 indicating 41 dB after sequential process^ 
ing. Hence, the sequential multiply operation provides an additional 
10-dB drain harmonic rejection not present in the triode pair by 
itself. 

Even in its initial hybrid implementation', the sequential 

' *: . ■ 

1 , 1 j ’ 4 » 

multiply operation has-been shown to contribute up to 58-dB additional 
input feedthrough rejection and up to 10-dB additional drain harmonic 
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rejection. Due to expected improvements in common-mode rejection, 
sample and hold retention, and noise immunity, these sequential 
processor enhancement factors are likely to be larger for a monolithic 
implementation. Furthermore, Table 4 suggests the use of a doubly- 
balanced triode provides another 10-dB of gate harmonic rejection. 
These improvements in performance may readily be incorporated into 
the equations (54) to nominally correct them from a singly-balanced 
triode without sequential processing to the case of a doubly-balanced 
triode using the sequential multiply operator with the projections 
listed in Table 3, 



( f c + f D l 


(f Q - f D > 

Figure 43 Single Multiplier Output 
Spectra After Sequential 
Processing (Extended Fre- 
Quency Scan) 
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ADDITIONAL FACETS OF THE ANALOG-ANALOG CORRELATOR 


Some aspects of analog-analog correlators are important 
when the devices are used to generate convolution/ correlation functions , 
such as data processing speed or the numerical count of multipliers? 
while others are important for analog matrix arithmetic applications, 
such as erroneous "fixed pattern" contributions, the adverse affects 
of which may be diminished for convolution/correlation function ap- 
plications by special operational modes. This section describes these 
additional facets of CTD analog correlators. 

8.1 Fixed Pattern Noise/Errors 

In most analog signal processing devices, the parallel or 
simultaneous performance of any function on NOMINALLY identical 
device elements, with subsequent combination or serial readout, often 
gives slightly different results from one device to the next due to 
spatial nonuniformities. For the analog correlator, such nonuniformi- 
ties show themselves in two distinct ways: Varying threshold or flat- 

band voltages across an array; AND nonuniform "geometric" parameters 
which involve the local effective widths, lengths, and thicknesses as 
well as the effective charge mobility at each spatial location within 
the array. The problem of nonuniform thresholds is best overcome 
by the sequential multiply operation whereby constituent computations 
of the true analog multiply are done serially on the same device 
element (with subsequent combination) which effectively cancels any 
influence by the threshold voltage on the final result. The sequential 
multiply technique, however, has no effect on the so-called "geometric" 
nonuniformities. Consequently, in this section, we analyze the effects 
of these "geometric" nonuniformities on the performance of the correlator. 
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8.1.1 Inpulse Response and Frequency Spectra of Nonuniformities 
(Or Fixed Patterns) 

The data flow architecture and operating sequences of a few 
more attractive schemes are illustrated in Figures 44,45/ and 46 The 
architecture of Figure 44 is basically identical to that, dis.cussed 
previously: where a multiplier is associated with one pair of analog 

! 

delay stages for EACH channel, wherein one of said pair of stages 
being the nondestructive readout type. This architecture is most 
closely associated with the operating scheme m which a correlation 
or convolution function is generated by holding the set of multiplica- 
tive weights effectively stationary in their respective analog shift 
register while the unknown signal slides by the prescribed multipli- 
cative weight pattern. In Figures 45 and 46- the correlation function 
is generated while both analog shift registers uniformly propagate 
their analog data forward. But now, the analog shift register 
of one channel has twice as many stages as the other channel, thereby 
resulting in a data propagation rate (passing by the multiplier cells) 
of only one-half the rate of the other channel. This gives the effect 
of data in the singly-sampled register "sliding" by the data in the 
doubly- samp led register, even though both are advancing with the same 
dwell time per stage. 

In summary, in the scheme of Figure 44 the set of multi- 
plicative weights appears imbedded in the spatial array during genera- 
tion of the correlation function. Thus, if the relative nonuniformity 

til til 

of the k cell is the error contribution for the k cell is 

given by which remains unchanged for the complete correlation 

function. On the other hand, the techniques of Figures 45 and 46 
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Figure 44 REAL TIME CCD CORRELATION USING Si'OUCNTTAL MULTIPLIERS 
WITH SINGLE CCD ADVANCE (WITH STATIONARY WEIGHTS) 

Number 


Sample Number 
1 


Advance X 
2 

Advance X 

3 (**) 

Advance X 

4 (**) 

Advance X 

5 (**) 


£ 


X 2 


°/*i 


o/x; 


X, 


ro/xr 

L_ 1 


v ri _ x ?] 


a 


fo/x 


o/x: 


X" 1 

3_l 


"5 


START NEXT CORRELATIONj 

6 X”' 


o/x_ 


0/X n 


OR START REPLICA UPDATE: 


-Js- 


8 ( A ") 


-in- 


0/X * 

Jl 

0/X 2 

* 

„. Y 2 

V X 3 

* 

Y 3 


°/x 2 

O/X* 

* 

Y 1 

O/x! 


Y 


2 


L 1 

o/x' 

Y 2 

0/X 5 

Y ? 
o/x! 


*■ Indicate Signal /Kepi i ca Jen Next Generation 
X' arc Previous Signal 

(A’W Outputs aL these Limes give the DELTIC, RtHG, or CYCLIC corre- 
lation, jl the X values be lot; the slash gives conventional 
con el lit ion . 


L 2 3' 




Figure 45 REAL TIME CCD CORRELATION USING SEQUENTIAL MULTIPLIERS 

FOR BATCH PROCESSING OF SIGNALS WITH DUAL UNIFORM SHIFTING 
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Figure 46 REAL TIME CCD CORRELATION USING SEQUENTIAL MULTIPLIERS 

FOR PROCESSING OF REPETITIVE SIGNALS WITH DUAL UNIFORM SHIFTING 
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move or convolve the multiplicative weights relative to the spatial 
location which gives a slightly different spectral treatment to the 
fixed pattern noise. 

To simplify the derivation of fixed pattern noise, consider 
first the impulse response of the device when a unit amplitude impulse 
propagates through the X-channel CTD . In the ideal case, in the ab- 
sence of nonuniformities, the impulse response may readily be written 
as : 


tJ 



where T c = clock time, f NY Q “ Nyquist Sampling limit = l/2T c , t = (N-l)T 
overall shift register delay, f '= 1/t = associated fundamental fre- 
quency of shift register. If the N discrete sample values for the 
Y-channel are regarded as the values of a repetitive waveform of period 

t which is sampled at the rate f = 1/T , the following formulas are 

c c 

obtained (where the index and discrete samples are module N) : 
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We may now consider the more realistic case of array non- 
uniformities , e^, for a device operated as shown in Figure 44 with 
stationary weights to give an ERROR impulse response function: 
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Such a result is entirely expected: For operation with non- stationary 

weights, where they convolve m the time domain through all allowed 
spatial positions, the frequency domain function is related to the 
direct product of the individual frequency domain functions: The 


second is the frequency spectrum of the selected weights; while the 
first is the frequency spectrum of the geometric, fixed pattern errors 


8.1.2 Statistical Evaluation for Devices 

For formula (81) in the case of the nonstationary weights, 
we see the spectra of the filter and of the fixed pattern errors 
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multiply together. This is equivalent to applying the desired filter 
not only to the input signal but the fixed pattern errors, too. But 
indeed, there is a more important side effect of this behavior: 

The geometric fixed pattern errors are probablistic, being described 
by some multivariate probability density function, p(e^ f e ) , which is 
used to predict such experimentally measured parameters as: 

1) mean error = e ] c = E(e k ) (82) 

- 9 Vz 

2) RMS error J = [E(ep] (83) 

3) Error variance = a ^ = Ej(£ k ~7 k ) 2 | = (7 2 ) - (7 fe ) 2 (8.4) 


We may now apply such a statistical evaluation to our pro- 
ceeding error analysis, incorporating a time domain average over a 
block of data of duration equal to the characteristic repetition time, 

NT . We designate <f(t)> = Time average of f(t) over 0 < t < NT . 
c c 


or 


and 


First we consider the case of the stationary weights: 

e K j K . Tkus e(<U*)>) = 2 5k ; 

K=| K= ' 


N N 

^ - - 




K=t J-< 


nI 


K ^ 




a* % 


Since 


we find 


K“ t Ji~i 







1=' 



* which gives the variance: 
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d 


K“t / = * 


( 86 ) 


The impulse response of a filter with uniform weights (that is, h £ 


has Y K 1 * * * * for all 1 _< k <_ N) gives an output dependent only on array 
nonuniformities : 


V«r (<(,>>)= 


M-l 


( 87 ) 


Further assumption that the fixed pattern errors are statistically 
independent between array spatial locations ; that is , 


we Uve 

Vat (< f[ c (t)>j — l\i - [\/a t ( £k)] * 


*K 6 JL ~ 

( 88 ) 


1 

This actually follows directly, because <h £ (t) > has become merely the 

sum of statistically independent variables, all having the same mean 
and variance. 

In reality, of course, some of the factors causing array non- 
uniformities are not statistically independent random variables from one 
spatial position in the array to another; such as dielectric thickness 

and background/body doping which may vary gradually across a wafer. 
Consequently, such statistical dependence affects both variance 
or amplitude of the fixed pattern noise as well as the fine structural 

details of the spectra. For example, fabrication of a mask set incor- 
porating some forms of mirror symmetry may result in an undesirable 
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"odd-even" pattern giving an extraordinarily large spectral component 
directly at the Nyquist sampling limit. 


We may now consider the case of non-stationary weights: 

Because the multiplicative weights are convolved through all possible 
spatia-l locations within the array, we may extend the preceeding develop- 
ment for stationary weights by postulating an "effective" fixed 


pattern error: 


fi 






X-t 


for 0<t±NT c> 


directly resultant from the comparison of the formulas for (t) 

and 0 (t) . Following the same derivation as before yields: 
e 


< 6* (*)>•= ^ 




$ 




or 


^ 22 f • 

L * Jc ( 


Similarly ^ £*£t)>) 2 ^ [si 


~z 


to *7 > 

/ -i in - 1 



so that 



a/ a/ 



/-I 




which yields the 
■variance 




A/ 



j-< <«m 




3 *1* 


)] 


(89) 
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Again, statistically independent errors give the result 


VW(< it) = 



(90) 


Thus, the "effective" error variance from time averaging each multi- 
plicative weight over all N spatial positions is reduced by the factor 
N compared to the case for stationary weights: 

Vdr (< <P e ‘ M>) = [Vai- (< W)] (9i) 

for 

statistically independent fixed pattern errors. 

This improvement with non-stationary weights may be explained 
heuristically : The "effective" error is reduced by the probabilistic 

cancellation of the various independent random values with some 
increasing and some decreasing the desired multiplicative value. 

8.1.3 Further Comparison: Stationary Versus Nonstationary Weights 

The above discussion covers the multiplicative fixed pattern 
error noise, but other items are also affected by the selection of 
stationary versus nonstationary weights, such as errors due to 
nonuniform leakage, chip architecture (like the number of multiplier 
cells needed) , and the times needed to compute the double-sided 
correlation functions indicated m Figures 44 , 45 and 46 all of which 
are compared in Table 5 . 
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Table 5": Comparison of "N-Point" Correlator Modes; Stationary Versus Nonstationary 


ITEM 


STATIONARY WEIGHTS 


NONSTATIONARY WEIGHTS 


Multiplicative Fixed (Array Variance)~(N) • (Single-Point Variance) 

Pattern Noise 


Non Uniform Leakage Accumulation of leakage charge during 

correlation function shifts DC value of 
multiplicative weights. 

Analog S/R Stages One pair for each channel, 

per Array Multiplier 


Signal Process Type 
No. of Multiplier Cells 
Associated Figure No. 
No. of Signal Sample 
Intervals Needed for: 
New Signal Only 

Full Update of Signal 
and Multipliers/Replica 


Nonrepetitive* (Batch) 
N 
1 


2N-1 

3N-2 


Repetitive (Cyclic) 
N 
1 


2N-1 

2N-1 


(Array Variance)/v(Single-Point Variance) 
Array averaging reduces this (error) noise. 

Uniform shifting of weights eliminates 
nonuniform leakage and associated errors. 


One pair for signal channel, 

-£wo pairs for multiplier/replica channel. 

Nonrepetitive* (Batch) Repetitive (Cyclic) 
2N N 

3 


3N-1 2N 

3N-1 2N 


*A double-sided correlation function (as indicated in Figures 44,45,46) has been assumed. 



First, consider the effects of nonuniform leakage in the 
analog shift register which stores the set of multipliers. For the 
case of stationary weights as in Figure 44 the accumulation of 
nonuniform leakage charge during the computation time for the cor- 
relation function is indistinguishable from a change during that 
computation in the stored value of the multiplicative weights. On 
the other hand. Figures 45 and 46 feature uniform shifting of both 
analog shift registers so that leaky spots contribute equal charge 
quanta to all charge packets; thereby eliminating the nonuniform 
charges in multiplier values and the resultant errors. But this 
improved performance does not come without cost. 

Comparison of Figures 45 and <46 with Figure 44 shows some 
of the extra elements needed to give the improved performance. Whereas, 
for the stationary weights mode, each multiplier in the physical 
array needs one pair of analog shift register stages for each channel 
(one stage for reference only, the other stage for "reference plus 
signal") the nonstationary weights mode requires that one of the 
channels incorporate two such pairs of stages per multiplier in one 
of the analog shift registers. Although the architecture is somewhat 
different, the analog sampling rates for both channels for either 
operational mode stay the same. That is, one channel stores double 
samples of the weights, for example, while the other channel stores 
single samples of the signal. Since the two analog delay channels 
are uniformly shifting at the same rate, the batch of signal samples 
"slides" by the batch of weights as in Figure 45 Furthermore, in this 
mode of operation, while the analog sampling of the signal is straight- 
forward; providing double samples of the weights for shifting at the 
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same rate as the signal introduces a new and different analog input- 
interface situation. 

Some additional differences are included in the lower part 
of Table 5 . For conventional double-sided correlations of non- 
repetitive data (as in Figure 46 ) , the use of nonstationary weights 
requires an array of 2N multipliers to perform an N-point correlation, 
in contrast to the other schemes which require only N multipliers. Also, 
more computation time (more shift register advances) is needed for 
the nonstationary weights scheme. 

We may summarize the relative merits as follows: It is 
important to note that computations for analog matix arithmetic 
(involving specific sums of products) uniquely define the needed 
products, that is, a single exactly prescribed alignment of the com- 
ponents of the analog vectors . Thus analog matrix arithmetic applica- 
tions do not allow the choice of using nonstatronary weights versus 

• . 1 
stationary weights. The impulse responses for uniform weights, h (t) 

£ 

and J2f £ (t) , however do serve as useful tools to measure the errors 
due to array nonuniformities and the extent of correlation of those 
nonuniformities, respectively. Applications involving complete 
correlation/convolution functions (i.e., time-sequential streams of 
output data points ) 9 in contrast to those for analog matrix arithmetic 

i 

with a single output in time, have thd option of using nonstationary 
weights to obtain reduced errors from fixed pattern nonuniformities 
in array element parameters (including leakage) at the expense of 
needing somewhat larger arrays and longer processing times. Hence this 
becomes an important consideration for the period of early development 
of a fully-integrated, monolithic implementation of an analog-analog 
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correlator: The architecture needed for the nonstationary weights 

scheme for computing correlation functions can easily be used in the 
stationary weights mode, but the converse is not true; thus suggesting 
that any developmental chip for demonstrating feasibility of monolithic 
implementation and featuring the architecture for nonstationary 
weights operation greatly facilitates actual comparison of the two 
schemes thereby broadening the spectrum of possible users. 

8 . 2 Speed/Multiplicity Enhancement by Multiplexing 

For various applications, either the number of multipliers 
required or the data processing rate or both may be so high that 
appreciable degradation of data occurs before the desired multiplication 
takes place due primarily to poor charge transfer efficiency (CTE) . 

Even with extremely bad CTE, the signal attenuation per stage is so 
.small that geometric pre-emphasis by means of tapering the size of the 
multiplying element larger, as the signal progresses through the delay 
line, is not feasible. Furthermore, even if a correlator chip based 
on floating gates and buried channel CCD were to give good error 
re 3 ection and a large input dynamic range at very high speeds with 
large numbers of stages, new application areas most certainly would open 
with more demanding requirements. Consequently, techniques to multiply 
the effective speed and number of stages by means of multiplexing 
deserve some consideration from the outset, if for no other reason 
than to guarantee that multiplexing is not rendered impossible because 
of an accidental oversight. 

The four-step sequential process lends itself naturally to 
either a 4:1 or 8:1 multiplexing ratio. A likely block diagram for 
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4:1 multiplexing of doubly-balanced triodes is illustrated in Figure 47 
with emphasis on the clocking simplicity due to the symmetric nature 
6f the sequential multiply. Each "quadrant" of the large composite 
chip is immediately recognized as the block diagram of Figure 32 , 
-including charge-coupled sequential processor with fully reconstructed 
output waveforms. Although all quadrants are computing at a quarter 
of the total data rate, they are phased 90° relative to each other so 
that a new computation is added into the output at every quarter of 
the reduced clock rate, i.e., at exactly the total effective data 
rate. Thanks to the sample and hold output within each constituent 
subarray (quadrant) , the passive output summation scheme illustrated 
in Figure 48 may be used. Difficulty with such a passive scheme 
(i.e., no active selective address switching, only pentode current 
addition) most likely would involve subarray output amplifier matching. 
Gain matching may be approached by using large dimensions for the 
MOSFET's. Threshold nonuniformities add to give a constant dc offset 
which can be removed by a final clamping or dc restoration if ac coupling 
is not allowed. 
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Figure 47. Multiplex of Doubly -Balanced Triode Multiplier Arrays 
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9) SUMMARY AND CONCLUSIONS 


The information presented in the preceding sections describes 
almost every important aspect of analog-analog correlators or con- 
volvers based on CTD's. An elementary investigation of various mul- 
tiplying elements and their marriage with CTD analog delay lines 

3,5 2,4 

was experimentally verified in both hybrid and partially integrated 
versions, all of which served very useful functions by demonstrating; 

1) The successful marriage between CTD’s and triode multipliers. 

2) The need for elaborate routines for operator-controlled 
off-chip optimal biasing (via spectral analysis) , thereby emphasizing 
the necessity for an on-chip self-bias technique. 

3) Uncompensated- array error rejection limited by threshold non- 
uniformities to give a per multiplier input feedthrough rejection 
not exceeding 15dB. 

4) Power levels of three to five milliwatts per buffer. 

Since the difficulties highlighted by the last three items above had been 
(anticipated from the outset, contract work was strongly focused on 
developing solutions to those problems as well as others related to 
making such devices attractive to users so as to achieve a larger potential 
market in accordance with the objectives presented m the introduction. 

As a direct consequence of the analog-analog correlator program, many 
more broadly applicable techniques have been developed and specialized 
to CTD correlators. A partial list of some of the more significant 
contributions follows : 

1) The sequential multiply operation permits the joint operation 
of arrays of multipliers by providing detailed compensation for array 
threshold nonuniformities and cancellation for some harmonic distortion, 
too . 
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2) Relaxed fabrication uniformity tolerances for nominally 
matched circuit elements, because the objective of the nominal 
matching is merely to help ease the requirements on the serial sub- 
traction steps in the sequential multiply operation. 

3) The seif-bias technique provides for the multiplying elements 
both the needed balanced and biased drives by means of complementary 
type circuits and reference sources incorporated within the device 
architecture so as to eliminate the need for operator optimized biasing, 
while simultaneously enabling essentially identical operation for both 
analog delay channels . 

4) A CMOS pentode multiplier cell featuring very low power con- 

sumption with reduced sensitivity to bias and threshold problems, but 
requiring some additional fabrication technology development beyond 
today's commercial CMOS process. 

5) Formulas relating correlator design (sizes) , operation (biases) , 

< 

and performance (error rejection, power, speed) which facilitate trade- 
offs or optimizing such devices for the intended user. 

6 ) A simplified MOS implementation of a current differencing 
function capable of providing balanced ac-virtual-ground current-input 
nodes with optional external dc voltage control and either current or 
voltage readout. 

7) A monolithic, MOS/CTD design for the sequential multiply 
operator. 

For the most part, these contributions (except numbers 4 and 7 
above) have been experimentally implemented and feasibility/veracity 
was demonstrated. Consequently, all the above results may be applied 
to a typical problem to illustrate how to use the findings of the 
investigation . 
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Suppose an analysis of system requirements shows a need 
for submegahertz multiply rates with 40dB rejection of input feed- 
through and harmonic distortion but also with an input dynamic range 

i 

of only 30dB for each multiplier cell in the correlator. Substituting 
these values into formula (61) yields the result 


Y 


o 



AV„ 


> 43.9 


Thus a typical value of Y q - ~ 6 volts (as from Table 2) gives 

x m 

the fabrication requirement of AV^ < 0.137 volt, which is quite rea- 
sonable. This result is then substituted into equation (60) to 
determine geometric ratios versus current source (CS) drive. For the 
assumption implied in table 2 (i.e., a two volt pinch-off, grounded 
gate) the result becomes | (W/L) B (W/L) cs \ 1//2 ^ 34 2 

Since these results so closely approximate the values given in Table 2, 
we know the doubly-balanced triode multiplier cell just now calculated 
can handle the deisred multiply rates. Indeed, for the next step of 
considering power consumption, we take advantage of the similarity 
between the desired multiplier cell and those described in Table 2. 

When the architectural requirements of the various multiplier 
cells are applied to the data of Table 2 and Figure 6 , one obtains 
the comparison of Table 6 . The two doubly-balanced triode multiplier 
cells are basically the doubly -balanced versions (rather than quadruply 
balanced) of the circuit of Figure 16 . The difference of squares balanced 
pentode cell is shown m Figure 33 with buffers like those of Table 2; 
except the basic follower buffer can be much smaller, since it drives 


1 41 



142 


Table 6 Projected Performance Comparison for Correlator Multiplier 

Cells With 40-dB Rejection of Input Feedthrough and Harmonic 
Distortion, and 1 MHz Operation. 


Technology 

Singular MOS Depletion Mode 

_3 

CMOS Depletion Mode P-Channel with 5E14 cm 
Substrate, 5E15 cm - 3 Isolation, Tub 

Circuit Configuration 

Doubly Balanced Triode as per 

Doubly Balanced Triode 

Difference of Square 


Figure 37 

as per Figure 37 

Balanced Pentodes as 
per Figure 38 

Basic Follower Buffer 



• 

Number needed 

3 

3 

2 

Voltage supply 

5 

5 

5 

Current, a 

135 

135 

18 

Power, mW 

2,025 

2.025 

0.18 

Complementary Buffer 

Inverter 

Complementary Follower 

Complementary Follower 

Number needed 

2 

3 

1 

Voltage Supply , 

15 

7 

7 

Current, y a 

54 

18 

18 

Power, mW 

1.62 

0.252 

0.126 

Multiplying Element 

Matched Triode Pair 

Matched Triode Pair 

Dual CMOS Pentode Pairs 

l 

Voltage 

0 

Q 

15 

Current, ya 



5 

Power , mW 

0 

0 

0.150 

Differential Current to 

(CSRO Circuit not usable because 

15 (per leg) 

2.5 

Voltage Circuit 

it needs CMOS Process) 

12 

22 

Incremental Current, ya 

' 2.25 

(2) (7) (0 . 135) + (12) (0.03) 

Voltage Supply 



(2) (/) (U.UU5) + (22) 

Power, mW 


= 2.25 

(0.0025) 
= 0.125 

Total Power, mW 

5.895 

4.527 

0.581 

Relative Power 

10.1 

7.8 

1.0 




only a gate with no current drive or conductance ratio requirements. 
Likewise the differential current to voltage circuit need only subtract 
the much smaller pentode "square" currents. Immediately apparent 
from Table 6 is the effect on power consumption of the requirement by 
the triode multiplier that the basic follower buffers provide much 
lower impedance than the triode multiplying elements in order to behave 
like voltage sources controlling the triode multipliers ' source-to-drain 
voltage. 

Table 6 shows the difference of squares balanced pentodes 
multiplier cell to be very attractive for various reasons: Not only 

is the power consumption much lower than for triode cells of comparable 
performance, but the required number of transistors and active semi- 
conductor area are also substantially less. Another advantage indirectly 
implied m Table 6 results from the pentode operating mode for every 
transistor, thereby providing reduced sensitivity in the multiplying 
function to both bias and threshold changes. On the negative side, 
the difference of squares balanced pentode multiplier cell involves 
two uncertainties : First is the fabrication requirement for doping 

levels substantially below those presently used by commercial suppliers. 
The second uncertainty is the lack of predictive information or formulas 
describing the dynamic range of the multiplication function and re- 
lating it other parameters (as equations (60) and (61) do for triode 
multipliers) because of the unavailability of either devices or data as 
per the specification of Figure 6 . 

Table 6 also helps with additional conclusions concerning 
triode multiplier cells. While CMOS fabrication provides somewhat lower 
power by virtue of the complementary follower gate buffer, the singular 
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MOS approach gives simpler fabrication with some tradeoffs available 
to lower power consumption. One such compromise is to replace the 
higher power inverter with a lower power source-follower, which is 
then likely to require operating the two analog delay channels with 
different effective biases in order to obtain the minimal biases 
needed by the multiplier. Another observation which is true for all 
the schemes shown in Table 6, but which most strongly affects the power 
consumption of the singular MOS approach, concerns the use or omission 
of the ion- implant step to give depletion mode operation. Enhancement 
mode circuitry requires increasing the voltage supply values by an 
amount equal to the sum of the depletion pinch-off plus enhancement 
threshold times the number of series transistors so changed. For 
example, changing from a 2 volt pinch-off to a 2 volt enhancement threshold 
for a simple inverter of two series transistors gives a probable need 
to increase the supply voltage by (2) (2+2) =8 volts or more to allow 
for a larger reverse-bias source-body effect. Finally we note the 
simplified current differencing technique of Figures 15,16,17 and 18 
is not practical for singular MOS devices, thereby requiring a more 
complex approach as indicated in Figure 13 . 

We now summarize the preceding conclusions. For CTD analog- 
analog correlators or convolvers, many design rules and formulas as well 
as new techniques have been developed > and experimentally demonstrated. 

When provided performance requirements, the design formulas give 
limiting values to key circuit parameters, with allowances incorporated 
for the specifics of the fabrication process. Consequently, the techniques 
described herein may be applied to the results of a survey of the 
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requirements of potential users of such a device in order to design 
(or at least specify) one or a limited set of such CTD analog-analog 
correlator devices , which are attractive to the largest practical 
market. 
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12.0 APPENDIX A 


A 16-Point - Correlator Hybrid Feasibility Demonstrator 
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